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5 Summary of the invention 

This invention relates to peptides which are fragments of 
protein products arising from frameshift mutations in 
genes, which peptides elicit T cellular immunity, and to 
0 cancer vaccines and compositions for anticancer treatment 
comprising said peptides. 



The invention further relates to a method for identifying 
such peptides which are fragments of protein products 
15 arising from frameshift mutations in genes, which may 

elicit T cellular immunity which is useful for combating 
cancer associated with said mutated genes. 

The invention also relates to DNA sequences encoding at 
20 least one frameshift mutant peptide, and vectors 

comprising at least one insertion site containing a DNA 
sequence encoding at least one frameshift mutant 
peptide. 

25 Further the invention relates to methods for the 

treatment or prophylaxis of cancers associated with 
frameshift mutations in genes by administration of at 
least one frameshift mutant peptide or a recombinant 
virus vector comprising at least one insertion site 

30 containing a DNA sequence encoding at least one 

frameshift mutant peptide, or an isolated DNA sequence 
comprising a DNA sequence encoding at least one 
frameshift mutant peptide. 

35 The present invention represents a further development of 
anticancer treatment or prophylaxis based on the use of 
peptides to generate activation and strengthening of the 



anti cancer activity of the T cellular arm of the body's 
own immune system. 



5 Technical Background 

Tumour antigens, Status: 

T cell defined antigens have now been characterised in a 

10 broad spectrum of cancer types. These antigens can be 

divided into several main groups, depending on their 

4 . x. x. -i ^ * rettiUC 

expression. The two main groups are constituted by 1 

.developmental differentiation related antigens 

(tumour- testis antigens, oncofoetal antigens etc., such 

15 as MAGE antigens and CEA) and tissue specific 

differentiation antigens (Tyrosinase, gplOO etc.). The 
group containing the truly tumour specific antigens 
contains proteins that are altered due to mutations in 
the genes encoding them. In the majority of these, the 

20 mutations are unique and have been detected in a single 
or in a small number of tumours. Several of these 
antigens seem to play a role in oncogenesis. 



25 



Cancer vaccines, Status 



The focus in cancer vaccine development has been on 
antigens expressed in a high degree within one form of 
cancer (such as melanoma) or in many kinds of cancers. One 
reason for this is the increased recruitment of patients 
30 into clinical protocols. The field is in rapid growth, 

illustrated by the accompanying table listing the cancer 
vaccine protocols currently registered in 
the PDQ database of NCI. 
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Inheritable cancer/cancer gene testing: 

Inherited forms of cancer occur at a certain frequency in 
the population. For several of these cancer forms, the 
underlying genetic defects have been mapped. This is also 
the case in Lynch syndrome cancers which constitute an 
important group of inheritable cancer. In families 
inflicted with this syndrome, family members inherit defect 
genes encoding DNA Mismatch Repair (MMR) Enzymes. Carriers 
of such MMR defects frequently develop colorectal cancer 
(HNPCC) and other forms of cancer (list?) . Mutations in MMR 
enzymes can be detected using gene testing in the same way 
as other cancer related genes can be detected. 

Gene testing of risk groups in this case represents an 
ethical dilemma, since no acceptable forms for prophylactic 
treatment exist. At present surgery to remove the organ in 
danger to develop cancer has been the only treatment 
option. In these patients, cancer vaccines will be a very 
(interesting) form of prophylaxis, provided efficient 
vaccines can be developed. 

The lack of efficient repair of mismatched DNA results in 
deletions and insertions in one strand of DNA, and this 
happens preferentially in stretches of DNA containing 
repeated units (repeat sequences) . Until now, focus has 
been on repeat sequences in the form of non-coding 
microsattelite loci. Indeed microsattelite instability is 
the hallmark of cancers resulting from MMR defects. We 
have taken another approach, and have concentrated on 
frameshift mutations occurring in DNA sequences coding for 
proteins related to the oncogenic process. Such frameshift 
mutations result in completely new amino acid sequences 
in the C-terminal part of the proteins, prematurely 
terminating where a novel stop codon appears. This results 
in two important consequences : 
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1) The truncated protein resulting from the frameshift is 
generally nonfunctional, in most cases resulting in 
"knocking out" of an important cellular function. Aberrant 
proteins may also gain new functions such as the capacity 

5 to aggragate and form plaques. In both cases the frameshift 
results in disease. 

2) The short new C- terminal amino acid sequence resulting 
from the shift in the reading frame (the "frameshift 

10 sequence") is foreign to the body. It does not exist prior 
to the mutation, and it only exists in cells having the 
mutation, i.e. in tumour cells and their pre malignant 
progenitors. Since they are completely novel and 
therefore foreign to the immune system of the carrier, they 

15 may be recognised by T-cells in the repertoire of the 
carrier. So far, nobody has focused on this aspect of 
frameshift mutations, and no reports exist on the 
characterisation of frameshift peptides from coding regions 
of proteins as tumour antigens. This concept is therefore 

20 novel and forms the basis for developing vaccines based on 
these sequences. It follows that such vaccines may also be 
used prophyllactively in persons who inherit defective 
enzymes belonging to the MMR machinery. Such vaccines will 
therefore fill an empty space in the therapeutic 

25 armament against inherited forms of cancer. 

It has been shown that single amino acid substitutions in 
intracellular "self "-proteins may give rise to tumour 

30 rejection antigens, consisting of peptides differing in 
their amino acid sequence from the normal peptide. The T 
cells which recognise these peptides in the context of 
the major histocompatibility (MHC) molecules on the 
surface of the tumour cells, are capable of killing the 

35 tumour cells and thus rejecting the tumour from the host. 



In contrast to antibodies produced by the B cells, which 
typically recognise a free antigen in its native 
conformation and further potentially recognise almost any 
site exposed on the antigen surface, T cells recognise an 
antigen only if the antigen is bound and presented by a 
MHC molecule. Usually this binding will take place only 
after appropriate antigen processing, which comprises a 
proteolytic fragmentation of the protein, so that the 
resulting peptide fragment fits into the groove of the 
MHC molecule. Thereby T cells are enabled to also 
recognise peptides derived from intracellular proteins. T 
cells can thus recognise aberrant peptides derived from 
anywhere in the tumour cell, in the context of MHC 
molecules on the surface of the tumour cell, and can 
subsequently be activated to eliminate the tumour cell 
harbouring the aberrant peptide. 

M.Barinaga, Science, 257, 880-881, 1992 offers a short 
review of how MHC binds peptides. A more comprehensive 
explanation of the Technical Background for this 
Invention may be found in D. Male et al, Advanced 
Immunology, 1987, J.B. lippincott Company, Philadelphia. 
Both references are hereby included in their entirety. 

The MHC molecules in humans are normally referred to as 
HLA (human leukocyte antigen) molecules. They are encoded 
by the HLA region on the human chromosome No 6. 

The HLA molecules appear as two distinct classes 
depending on which region of the chromosome they are 
encoded by and which T cell subpopulations they interact 
with and thereby activate primarily. The class I 
molecules are encoded by the HLA A, B and C subloci and 
they primarily activate CD8+ cytotoxic T cells. The HLA 
class II molecules are encoded by the DR, DP and DQ 
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subloci and primarily activate CD4+ T cells, both helper 
cells and cytotoxic cells. 

Normally every individual has six HLA Class I molecules, 
5 usually two from each of the three groups A, B and C. 

Correspondingly, all individuals have their own selection 
of HLA Class II molecules, again two from each of the 
three groups DP, DQ and DR. Each of the groups A, B, C 
and DP, DQ and DR are again divided into several 
10 subgroups. In some cases the number of different HLA 

Class I or II molecules is reduced due to the overlap of 
two HLA subgroups. 

All the gene products are highly polymorphic. Different 
15 individuals thus express distinct HLA molecules that 

differ from those of other individuals. This is the basis 
for the difficulties in finding HLA matched organ donors 
in transplantations. The significance of the genetic 
variation of the HLA molecules in immunobiology is 
20 reflected by their role as immune-response genes. Through 
their peptide binding capacity, the presence or absence 
of certain HLA molecules governs the capacity of an 
individual to respond to peptide epitopes. As a 
consequence, HLA molecules determine resistance or 
25 susceptibility to disease. 

T cells may control the development and growth of cancer 
by a variety of mechanisms. Cytotoxic T cells, both HLA 
class I restricted CD8+ and HLA Class II restricted CD4+, 
30 may directly kill tumour cells carrying the appropriate 
tumour antigens. CD4+ helper T cells are needed for 
cytotoxic CD8+ T cell responses as well as for antibody 
responses, and for inducing macrophage and LAK cell 
killing. 

35 

A requirement for both HLA class I and II binding is that 
the peptides must contain a binding motif, which usually 



is different for different HLA groups and subgroups. A 
binding motif is characterised by the requirement for 
amino acids of a certain type, for instance the ones 
carrying large and hydrophobic or positively charged side 
groups, in definite positions of the peptide so that a 
narrow fit with the pockets of the HLA binding groove is 
achieved. The result of this, taken together with the 
peptide length restriction of 8-10 amino acids within the 
binding groove, is that it is quite unlikely that a 
peptide binding to one type of HLA class I molecules will 
also bind to another type. Thus, for example, it may very 
well be that the peptide binding motif for the HLA-A1 and 
HLA-A2 subgroups, which both belong to the class I 
gender, are as different as the motifs for the HLA-A1 and 
HLA-B1 molecules. 

For the same reasons it is not likely that exactly the 
same sequence of amino acids will be located in the 
binding groove of the different class II molecules. In 
the case of HLA class II molecules the binding sequences 
of peptides may be longer, and it has been found that 
they usually contain from 10 to 16 amino acids, some of 
which, at one or both terminals, are not a part of the 
binding motif for the HLA groove. 

However, an overlap of the different peptide binding 
motifs of several HLA class I and class II molecules may 
occur. Peptides that have an overlap in the binding 
sequences for at least two different HLA molecules are 
said to contain "nested T cell epitopes". The various 
epitopes contained in a "nested epitope peptide" may be 
formed by processing of the peptide by antigen presenting 
cells and thereafter be presented to T cells bound to 
different HLA molecules. The individual variety of HLA 
molecules in humans makes peptides containing nested 
epitopes more useful as general vaccines than peptides 



8 



that are only capable of binding to one type of HLA 
molecule - 

Effective vaccination of an individual can only be 
achieved if at least one type of HLA class I and/or II 
molecule in the patient can bind a vaccine peptide either 
in it's full length or as processed and trimmed by the 
patient's own antigen presenting cells. 

The usefulness of a peptide as a general vaccine for the 
majority of the population increases with the number of 
different HLA molecules it can bind to, either in its 
full length or after processing by antigen presenting 
cells . 

In order to use peptides derived from a protein encoded 
by a mutated gene as vaccines or anticancer agents to 
generate anti tumour CD4+ and/or CD8+ T cells, it is 
necessary to investigate the mutant protein in question 
and identify peptides that are capable, eventually after 
processing to shorter peptides by the antigene presenting 
cells, to stimulate T cells. 



Prior art 

In our International Application PCT/N092/ 00032 
(published as W092/14756) , we described synthetic 
peptides and fragments of oncogene protein products which 
have a point of mutation or translocations as compared to 
their proto-oncogene or tumour suppressor gene protein. 
These peptides correspond to, completely cover or are 
fragments of the processed oncogene protein fragment or 
tumour suppressor gene fragment as presented by cancer 
cells or other antigen presenting cells, and are 
presented as a HLA-peptide complex by at least one allele 
in every individual. These peptides were also shown to 
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induce specific T cell responses to the actual oncogene 
protein fragment produced by the cell by processing and 
presented in the HLA molecule. In particular, we 
described peptides derived from the p21 ras protein which 
5 had point mutations at particular amino acid positions, 
namely position 12, 13 and 61. These peptides have been 
shown to be effective in regulating the growth of cancer 
cells in vitro. Furthermore, the peptides were shown to 
elicit CD4+ T cell immunity against cancer cells 

10 harbouring the mutated p21 ras oncogene protein through 
the administration of such peptides in vaccination or 
cancer therapy schemes. Later we have shown that these 
peptides also elicit CD8+ T cell immunity against cancer 
cells harbouring the mutated p21 ras oncogene protein 

15 through the administration mentioned above, ^ rOJ^l 

However, the peptides described above will be useful only 
in certain number of cancers, namely those which involve 
oncogenes with point mutations or translocation in a 
20 proto-oncogene or tumour suppressor gene. There is 

therefore a strong need for an anticancer treatment or 
vaccine which will be effective against a more general 
range of cancers. 

25 In general, tumors are very heterogenous with respect to 
genetic alterations found in the tumour cells. This 
implies that both the potential therapeutic effect and 
prophylactic strength of a cancer vaccine will increase 
with the number of targets that the vaccine is able to 

30 elicit T cell immunity against. A multiple target vaccine 
will also reduce the risk of new tumour formation by 
treatment escape variants from the primary tumour. 



35 
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Definition of Problem solved bv the Invention, 

There is a continuing need for new anticancer agents based 
on antigenic peptides giving rise to specific T cellular 
5 responses and toxicity against tumours and cancer cells 
carrying genes with mutations related to cancer. The 
present invention will contribute largely to supply new 
peptides that can have a use in the combat and prevention 
of cancer as ingredients in a multiple target anti-cancer 
10 vaccine. 

Another problem solved by the present invention is that 
a protection or treatment can be offered to the 
individuals belonging to family's or groups with high 

15 risk for hereditary cancers. Hereditary cancers arie in 
many cases associated with genes susceptible to 
frameshift mutations as described in this invention (i.e. 
mutations in mismatch repair genes) . Today it is possible 
to diagnose risk of getting hereditary cancer but no 

20 pharmaceutical method for protection against the onset of 
the cancer is available. 

Definition of the Invention 

25 

A main object of the invention is to obtain peptides 
corresponding to peptide fragments of mutant proteins 
produced by cancer cells which can be used to stimulate T 
cells. 

30 

Another main object of the invention is to develop a 
cancer therapy for cancers based on the T cell immunity 
which may be induced in patients by stimulating their T 
cells either in vivo or in vitro with the peptides 
35 according to the invention. 
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A third main object of the invention is to develop a 
vaccine to prevent the establishment of or to eradicate 
cancers based solely or partly on peptides corresponding 
to peptides of the present invention which can be used to 
5 generate and activate T cells which produce cytotoxic T 
cell immunity against cells harbouring the mutated genes. 

A fourth main object of the invention is to design an 
anticancer treatment or prophylaxis specifically adapted 
10 to a human individual in need of such treatment or 

prophylaxis, which comprises administering at least one 
peptide according to this invention. 

These and other objects of the invention are achieved by 
15 the attached claims. 

Since frameshift mutations result in premature stop 
codons and therefore deletion in large parts of the 
proteins, proteins with frameshift mutations have 
generally not been considered to be immunogenetic and 
have therefore not been considered as targets for 
immunotherapy. Thus it has now surprisingly been found 
that a whole group of new peptides resulting from 
frameshift mutations in tumour relevant genes are useful 
for eliciting T cell responses against cancer cells 
harbouring genes with such frameshift mutations. 

Genes containing a mono nucleoside base repeat sequence 
of at least five residues, for example of eighth 
deoxyadenosine bases (AAAAAAAA) , or a di-nucleoside base 
repeat sequence of at least fswir di-nucleoside base 1f 5e &w 

units, for example of two deoxyadenosine-deoxycytosine 
units (ACAC), are susceptible to frameshift mutations. 
The frameshift mutations occur, respectively, either by 
insertion of one or two of the mono-nucleoside base 
residue or of one or two of the di-nucleoside base unit 



20 



25 



30 



35 
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in the repeat sequence, or by deletion of one or two of 
the mono-nucleoside base residue or of one or two of the 
di-nucleoside base unit from the repeat sequence. A gene 
with a frameshift mutation will from the point of 
5 mutation code for a protein with a new and totally 

different amino acid sequence as compared to the normal 
gene product. This mutant protein with the new amino acid 
sequence at the carboxy end will be specific for all 
cells carrying the modified gene. 

10 

In the remainder of this specification and claims the 
denomination frameshift mutant peptides will comprise 
such proteins and peptide fragments thereof. 

15 It has now according to the present invention been found 
that such new protein sequences arising from frameshift 
mutations in genes in cancer cells give rise to tumour 
rejection antigens that are recognised by T cells in the 
context of HLA molecules. 

20 

It has further according to the present invention been 
found a group of peptides corresponding to fragments of 
mutant proteins arising from frameshift mutations in 
genes in cancer cells which can be used to generate T 
25 cells. The said peptides can therefore also be used to 

rise a T cell activation against cancer cells harbouring 
a gene with a frameshift mutation as described above. 

These peptides are at least 8 amino acids long and 
30 correspond, either in their full length or after 

processing by antigen presenting cells, to the mutant 
gene products or fragments thereof produced by cancer 
cells in a human patient afflicted with cancer. 

35 A peptide according to this invention is characterised in 
that it 



a) is at least 8 amino acids long and is a 
fragment of a mutant protein arising from a 
frameshift mutation in a gene of a cancer cell; 

and 

b) consists of at least one amino acid of the 
mutant part of a protein sequence encoded by said 
gene ; 

and 

c) comprises 0-10 amino acids from the carboxyl 
terminus of the normal part of the protein 
sequence preceding the amino terminus of the 
mutant sequence and may further extend to the 
carboxyl terminus of the mutant part of the 
protein as determined by a new stop codon 
generated by the frameshift mutation in the gene; 

and 

d) induces, either in its full length or after 
processing by antigen presenting cell, T cell 
responses . 



The peptides of this invention contain preferably 8-25, 
9-20, 9-16, 8-12 or 20-25 amino acids. They may for 
instance contain 9, 12, 13, 16 or 21 amino acids. 



It is most preferred that the peptides of the present 
invention are at least 9 amino acids long, for instance 
9-18 amino acids long, but due to the processing 
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possibility of the antigen presenting cells also longer 
peptides are very suitable for the present invention. Thus 
the whole mutant amino acid sequence may be used as a 
frameshift mutant peptide according to the present 
5 invention, if it comprises 8 amino acids or more. 

The invention further relates to a method for vaccination 
of a person disposed for cancer, associated with a 
frameshift mutation in a gene, consisting of administering 
10 at least one peptide of the invention one or more times in 
an amount sufficient for induction of T-cell immunity to 
the mutant proteins encoded by the frameshift mutated gene. 

The invention also relates to a method for treatment of a 
15 patient afflicted with cancer associated with frameshift 

mutation in genes, consisting of administering at least one 
peptide of the invention one or more times in an amount 
sufficient for induction of T-cell immunity to mutant 
proteins arising from frameshift mutations in the genes of 
20 cancer cells. 

Furthermore, it has according to the present invention been 
found a method for identifying new peptides which 
correspond to fragments of proteins arising from frameshift 
25 mutations in genes. This method is characterised by the 
following steps: 

1) identifying a gene in a cancer cell susceptible 
to frameshift mutation by having a mono nucleoside 
30 base repeat sequence of at least five residues, or 

a di-nucleoside base repeat sequence of at least 
four di-nucleoside base units; 

and 

35 
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2) removing, respectively, one nucleoside base 
residue or one di-nucleoside base unit from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/or 

3) removing, respectively, two nucleoside base 
residues or two di-nucleoside base units from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/or 

4) inserting, respectively, one nucleoside base 
residue or one di-nucleoside base unit in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/or 

5) inserting, respectively, two nucleoside base 
residues or two di-nucleoside base units in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon. 

In order to determine whether the peptides thus 
identified are useable in the compositions and methods 
according to the present invention for the treatment or 
prophylaxis of cancer, the following further step should 
be performed: 
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6) determining whether the new peptide, either in 
their full length or as shorter fragments of the 
peptides, are able to stimulate T cells. 

5 

Optionally a further step may be added as follows: 

7) determining peptides containing nested epitopes 
for different major HLA class I and/or HLA class II 

10 molecules. 



Detailed Description of the invention. 

15 

In the present description and claims/ the amino acids 
are represented by their one letter abbreviation as known 
in the art. 

20 The peptides of the present invention shall be explicitly 
exemplified through two different embodiments, wherein 
cancer develops based on frameshift mutations in specific 
genes, namely the BAX gene and TGFpRII gene: 

25 I) BAX gene 

It has been established that the BAX gene is involved in 
regulation of survival or death of cells by promoting 
apoptosis. The human BAX gene contains a repeat sequence 
of eight deoxyguanosine bases (G8) in the third exon, 

30 spanning codons 38 to 41 (AT G GGG GGG GA G) . 

Frameshift mutations in this G8 repeat have been 
observed, both as G7 (AT G GGG GGG AGG) and G9 (AT G GGG 
GGG GGA ) repeats, both in colon cancer cells and prostate 
35 cancer cells. The occurency is more than 50% of the 

examined cases (Rampino, N. et al., "Somatic frameshift 
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mutations in the BAX gene in colon cancers of the 
microsatellite mutator phenotype . Science (Washington 
DC), 275: 967-969, 1997). The modified BAX gene products 
are unable to promote apoptosis and thus makes further 
tumour progress possible. Furthermore the modified gene 
products are only found in cancer cells and are therefore 
targets for specific immunotherapy. 

According to the present invention, peptides 
corresponding to the transformed BAX protein products 
arising from frameshift mutations in the BAX gene can be 
used as anticancer therapeutical agents or vaccines with 
the function to trigger the cellular arm of the immune 
system (T-cells) against cancer cells in patients 
afflicted with cancers associated with a modified BAX 
gene . 

Frameshift mutations in the BAX gene result in mutant 
peptide sequences with the first amino acid of the 
altered sequence in position 41 as compared to the normal 
BAX protein (Table 1, seq.id. no. 1 to 4) . 

Table 1 

amino acid pos 41 51 61 71 

normal bax peptide ; EAPELALDPV PQDASTKKLS ECLKRIGDEL 
DS. . . 

seq id no 1 (bax-lG) ; RHPSWPWTRC LRMRPPRS 
seq id no 4 (bax+2G) ; GRHPSWPWTR CLRMRPPRS 

seq id no 2 (bax-2G) ; GTRAGPGPGA SGCVHQEAER VSQAHRGRTG Q 
seq id no 3 (bax+lG) ; GGTRAGPGPG ASGCVHQEAE RVS QAHRGRT GQ 

Table 2 shows one group of peptides according to the 
present invention : 



Table 2 

seq . id . no . 5 : IQDRAGRMGGRHPSWPWTRCLRMRPPRS 



seq.id.no. 6 
seq.id.no. 7 
seq.id.no. 8 
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I QDRAGRMGGGRHPSWPWT 

I QDRAGRMGGGGTRAG PGPGAS GCVHQEAERVS QAHRGRTGQ 
I QDRAGRMGGGTRAGPGPG 



5 The peptides listed in Table 3 were used for in vitro 

generation of T cells that recognise mutant BAX peptides. 

Table 3. 

seq id no 1: RHPSWPWTRCLRMRPPRS 

10 seq id no 9: I QDRAGRMGGRHPSWPWTRCLR 

seq id no 6: I QDRAGRMGGGRHPSWPWT 

seq id no 10: ASGCVHQEAERVSQAHRGRTGQ 

seq id no 11: GGTRAGPGPGASGCVHQEAERV 

seq id no 12: IQDRAGRMGGGGTRAGPGPGAS 

15 seq id no 8: I QDRAGRMGGGTRAGPGPG 

The most preferred peptides according to this embodiment 
of the present invention are listed in Table 4 : 



20 


Table 4 








seq 


id 


no 


1: 


RHPSWPWTRCLRMRPPRS 




seq 


id 


no 


2: 


GTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 




seq 


id 


no 


3: 


GGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 




seq 


id 


no 


4: 


GRHPSWPWTRCLRMRPPRS 


25 


seq. 


id. 


no . 


5: 


I QDRAGRMGGRHPSWPWTRCLRMRPPRS 




seq. 


id. 


no . 


6: 


I QDRAGRMGGGRHPSWPWT 




seq. 


id. 


no . 


7: 


I QDRAGRMGGGGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 




seq 


id 


no 


8: 


I QDRAGRMGGGTRAGPGPG 




seq 


id 


no 


9: 


I QDRAGRMGGRHPSWPWTRCLR 


30 


seq 


id 


no 


10: 


ASGCVHQEAERVSQAHRGRTGQ 




seq 


id 


no 


11: 


GGTRAGPGPGASGCVHQEAERV 




seq 


id 


no 


12: 


I QDRAGRMGGGGTRAGPGPGAS 



2) TGFPRII 

35 It has been established that the TGF0RII gene is involved 
in regulation of cell growthys.TjSFPRII is a receptor for 
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TGFP which down regulates cell growth. The human gene 
coding for TGFPRII contains a repeat sequence of ten 
deoxyadenosine bases (A10) from base no. 709 to base no. 
718 ( GAA AAA AAA AAG CCT) . In colon cancers and 
pancreatic cancers frameshift mutations in this A10 
repeat have been observed, both as A9 ( GAA AAA AAA AG C 
CT) and All ( GAA AAA AAA AAA GCC) repeats, in 
approximately 80 % of the examined cases (Yamamoto, H., 
"Somatic frameshift mutations in DNA mismatch repair and 
proapoptosis genes in hereditary nonpolyposis colorectal 
cancer.", Cancer Research 58, 997-1003, March 1, 1998). 
The modified TGFpRII gene products are unable to bind 
TGFP and the signal for down regulation of cell growth 
is eliminated and thus makes further tumour progress 
possible. Furthermore the modified gene products are only 
found in cancer cells and are therefore targets for 
immunotherapy . 

Consequently peptides corresponding to the transformed 
TGFpRII protein products arising from frameshift 
mutations in the TGFPRII gene can be used as anticancer 
therapeutical agents or vaccines with the function to 
trigger the cellular arm of the immune system (T-cells) 
against cancer cells in patients afflicted with cancers 
associated with a modified TGFPRII gene. 

Frameshift mutations in the TGFpRII gene result in mutant 
peptide sequences with the first amino acid of the 
altered sequence in either position 133 (one and two base 
deletions) or 134 (one and two base insertions) as 
compared to the normal TGFPRII protein (Table 5, seq id 
no 13 and 21) . 
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Table 5. 

amino acid pos. 

normal TGFpRII 

seq id no 13 (-1A) 
seq id no 13 (+2A) 

TGFbRII + 1A) 
TGFbRII - 2A) 



133 

K PGETFFMCSC SSDECNDNII FSEEYNTSNP 
DLLL 

S LVRLSSCVPV ALMSAMTTSS SQKNITPAIL TCC 
SLVRLSSCVP VALMSAMTTS SSQKNITPAI 

LTCC 
• AW 
r A W 



Table 6 shows one groups of peptides of this invention: 



Table 6 

seq id no 14 : SPKCIMKEKKSLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 
15 seq id no 15 : PKCIMKEKKKSLVRLSSCV 
seq id no 19 : SPKCIMKEKKAW 
seq id no 20 : PKCIMKEKKKAW 

Table 7 presents peptides that were used for in vitro 
20 generation of T cells that recognise mutant TGF0RII 
peptides . 





Table 7 








seq 


id 


no 


15: 


PKCIMKEKKKSLVRLSSCV 


25 


seq 


id 


no 


16: 


ALMSAMTTSSSQKNITPAILTCC 




seq 


id 


no 


17: 


SLVRLSSCVPVALMSAMTTSSSQ 




seq 


id 


no 


18: 


S PKCIMKEKKSLVRLSSCVPVA 




seq 


id 


no 


19: 


SPKCIMKEKKAW 




seq 


id 


no 


20: 


PKCIMKEKKKAW 


30 


seq 


id 


no 


21: 


AMTTSSSQKNITPAILTCC 




seq 


id 


no 


428: 


SLVRLSSCV 
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The most preferred peptides of this embodiment of the 
present invention are: 
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Table 8 

seq id no 13 : SLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 

seq id no 14:SPKCIMKEKKSLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 

seq id no 15 : PKCIMKEKKKSLVRLSSCV 

seq id no 1 6 : ALMSAMTTSSSQKNITPAILTCC 

seq id no 17:SLVRLSSCVPVALMSAMTTSSSQ 

seq id no 18:SPKCIMKEKKSLVRLSSCVPVA 

seq id no 1 9 : S PKC IMKEKKAW 

seq id no 2 0 : PKC IMKEKKKAW 

seq id no 21 :AMTTSSSQKNITPAILTCC 

seq id no428 : SLVRLSSCV 



Other peptides of the invention can be fragments of the 
peptides listed in the Tables 1-8 above. Such fragments 
are most preferred from 9-16 amino acids long and include 
at least one amino acid from the mutant part of the 
protein. 



As used in this description and claims the term fragment 
is intended to specify a shorter part of a longer peptide 
or of a protein. 

Other cancer associated genes containing repeat sequences 
of a nucleoside base and which therefore are susceptible 
to frameshift mutations and consequently are potential 
candidates for peptides according to the present 
invention (seq id nos according to table 9 are given in 
parentheses in each case) are the following: 

Human TGF-p-2 (hTGFp2) gene (seq id nos 22-29) 

Deleted in colorectal cancer (DCC) gene (seq id nos 
30-34) 

Human breast and ovarian cancer susceptibility ( BRCA1 ) 
gene (seq id nos 378-387) 
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Human breast cancer susceptibility (BRCA2) gene (seq id 
nos 35-94) 

Human protein tyrosine phosphatase (hPTP) gene (seq id 
nos 95-102) 

Human DNA topoisomerase II (top2) gene (seq id nos 
103-108) 

Human kinase (TTK) gene (seq id nos 109-120) 

Human transcriptional repressor (CTCF) gene (seq id nos 

121-127) 

Human FADD-homologous ICE/CED-3-like protease gene (seq 
id nos 128-133) 

Human putative mismatch repair/binding protein (hMSH3) 
gene (seq id nos 134-147) 

Human retinoblastoma binding protein 1 isoform I (hRBPl) 
gene (seq id nos 148-156) 

Human FMRl (hFMRl) gene (seq id nos 157-161) 
Human TINUR gene (seq id nos 162-169) 
b-raf oncogene (seq id nos 170-175) 

Human neurof ibromin (NF1) gene (seq id nos 176-181) 

Human germline n-myc gene (seq id nos 182-188) 

Human n-myc gene (seq id nos 189-194) 

Human ras inhibitor gene (seq id nos 195-199) 

Human hMSH6 gene (seq id nos 200-203 and 293-297) 

Human nasopharynx carcinoma EBV BNLF-1 gene (seq id nos 

204-210) 

Human cell cycle regulatory protein (ElA-binding protein) 
p300 gene (seq id nos 211-218) 

Human B-cell lymphoma 3-encoded protein (bcl-3) gene 
(seq id nos 219-226) 

Human transforming growth factor-beta induced gene 
product (BIGH3) (seq id nos 227-232) 

Human transcription factor ETV1 gene (seq id nos 233-239) 
Human insulin-like growth factor binding protein (IGFBP4) 
gene (seq id nos 240-246) 
Human MUCl gene (seq id nos 247-266) 

Human protein- tyrosine kinase (JAK1) gene (seq id nos 
267-271) 
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Human protein-tyrosine kinase (JAK3) gene (seq id nos 
272-279) 

Human Fit 4 gene (for transmembrane tyrosinase kinase) 
(seq id nos 280-284) 

Human p53 associated gene (seq id nos 285-292) 
Human can (hCAN) gene (seq id nos 298-300) 
Human DBL (hDBL) proto-oncogene / Human MCF2PO (hMCF2PO) 
gene (seq id nos 301-306) 

Human dek (hDEK) gene (seq id nos 307-309) 
Human retinoblastoma related protein (pl07) gene (seq id 
nos 310-313) 

Human G protein-coupled receptor (hGPRl) gene (seq id nos 
314-319) 

Human putative RNA binding protein (HRBP56) gene (seq id 
15 nos 320-325) 

Human transcription factor (hITF-2) gene (seq id nos 
326-327) 

Human malignant melanoma metastasis-supressor (hKiSS-1) 
gene (seq id nos 328-334) 

Human telomerase-associated protein TP-1 (hTP-1) gene 
(seq id nos 335-348) 

Human FDF-5 (hFDF-5) gene (seq id nos 349-356) 

Human metastasis-assosiated mtal (hMTAl) gene (seq id nos 
357-362) 

Human transcription factor TFIIB 90 kDa subunit 
(hTFIIB90) gene (seq id nos 363-369) 

Human tumour suppressor (hLUCA-1) gene (seq id nos 
370-377) 

Human Wilm's tumour (WIT-1) associated protein (seq id 
nos 388-393) 

Human cysteine protease (ICErel-III) gene (seq id nos 
394-398) 

Human Fas ligand (FasL) gene (seq id nos 399-403) 
Human BRCAl-associated RING domain protein (BARD1) gene 
(seq id nos 404-417) 

Human mcf.2 (hMCF.2) gene (seq id nos 418-422) 
Human Fas antigen (fas) gene (seq id nos 423-427) 
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Human DPC4 gene (seq id nos 429-437) . 

The mutant peptides that are the results of frameshift 
mutation in these genes, in accordance with the present 
5 invention, are listed in table 9. 





Table 9 










seq 


id 


no 


22; 


TVGRPHISC 




seq 


id 


no 


23; 


KTVGRPHISC 


10 


seq 


id 


no 


24; 


KQWEDPTS PANVIALLQT 




seq 


id 


no 


25; 


QWEDPTS PANVIALLQT 




seq 


id 


no 


26; 


QKT I KS T RKKTVGRPH ISC 




seq 


id 


no 


27; 


QKT I KS TRKKKT VGRPH ISC 




seq 


id 


no 


28, 


QKTIKSTRKKKQWEDPTSPANVIALLQT 


15 


seq 


id 


no 


29, 


■ QKTIKSTRKKQWEDPTSPANVIALLQT 




seq 


id 


no 


30, 


■ AADLQQQFVHFLDCWDVSSIPFTLHLPQAQDITT 




seq 


id 


no 


31, 


; GKDAKEKSS 




seq 


id 


no 


32, 


! GKDAKEKKS S 




seq 


id 


no 


33, 


r GKDAKEKKAADLQQQFVH FLDCWDVS S I PFTLHLPQAQDI TT 


20 


seq 


id 


no 


34, 


GKDAKEKAADLQQQFVH FLDCWDVS SIP FTLHLPQAQD ITT 




seq 


id 


no 


35, 


r FSMKQTLMNVKNLKTK 




seq 


id 


no 


36, 


KFSMKQTLMNVKNLKTK 




seq 


id 


no 


37, 


; VRT SKTRKKFSMKQTLMNVKNLKTK 




seq 


id 


no 


38, 


; VRT S KTRKKKFSMKQTLMNVKNLKTK 


25 


seq 


id 


no 


39, 


; VRT SKTRKKNFP 




seq 


id 


no 


40 


; VRTSKTRKNFP 




seq 


id 


no 


41 


; IKKKLLQFQK 




seq 


id 


no 


42 


; KIKKKLLQFQK 




seq 


id 


no 


43 


; KSRRNYFNFKNNCQSRL 


30 


seq 


id 


no 


44 


; SRRNYFNFKNNCQSRL 




seq 


id 


no 


45 


; TNLRVIQK IKKKLLQFQK 




seq 


id 


no 


46 


; TNLRVIQKKIKKKLLQFQK 




seq 


id 


no 


47 


; TNLRVIQKKSRRNYFNFKNNCQSRL 




seq 


id 


no 


48 


; TNLRVIQKSRRNYFNFKNNCQSRL 


35 


seq 


id 


no 


49 


; KIMIT 




seq 


id 


no 


50 


; NIDKIPEKIMIT 




seq 


id 


no 


51 


; NIDKIPEKKIMIT 
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seq id no 52; IINAN 

seq id no 53; KIINAN 

seq id no 54; NDKTVSEKI INAN 

seq id no 55; NDKTVSEKKI INAN 

seq id no 56; NGLEKEYLMVNQKE 

seq id no 57; S QT S LLEAKNGLEKE YLMVNQKE 

seq id no 58; SQTSLLEAKKNGLEKE YLMVNQKE 

seq id no 59; SQTSLLEAKKMA 

seq id no 60; SQTSLLEAKMA 

seq id no 61; TLVFPK 

seq id no 62; KTLVFPK 

seq id no 63; LKNVEDQKTLVFPK 

seq id no 64; LKNVEDQKKTLVFPK 

seq id no 65; LKNVEDQKKH 

seq id no 66; LKNVEDQKH 

seq id no 67; KKIQLY 

seq id no 68; KKKIQLY 

seq id no 69; RKR FS YTE YLAS 1 1 RFI FS VNRRKE I QNLS S CNFKI 
seq id no 70; LRIVSYSKKKKIQLY 
seq id no 71; LRIVSYSKKKKKIQLY 

seq id no 72; LRIVSYSKKRKRFSYTEYLASIIRFIFSVNRRKEIQNLS- 

-SCNFKI 

seq id no 73; LRIVSYSKRKRFSYTEYLASIIRFIFSVNRRKEIQNLS- 

-S CNFKI 

seq id no 74; QDLPLSSICQTIVTIYWQ 

seq id no 75; KQDLPLSSICQTIVTIYWQ 

seq id no 76; NRTCPFRLFVRRMLQFTGNKVLDRP 

seq id no 77; GFWSWKKQDLPLSSICQTIVTIYWQ 

seq id no 78; GFWSWKKKQDLPLSSICQTIVTIYWQ 

seq id no 79; GFVVSVVKKNRTCPFRLFVRRMLQFTGNKVLDRP 

seq id no 80; GFWSWKNRTCPFRLFVRRMLQFTGNKVLDRP 

seq id no 81; YRKTKNQN 

seq id no 82; KYRKTKNQN 

seq id no 83; NTERPKIRTN 

seq id no 84; DETFYKGKKYRKTKNQN 

seq id no 85; DET FYKGKKKYRKTKNQN 



15 



seq 


id 


no 


ft /" - 

8 6; 


seq 


id 


no 


87 ; 


seq 


id 


no 


ft o . 

88 ; 


seq 


id 


no 


89; 


seq 


id 


no 


90; 


seq 


id 


no 


91 ; 


seq 


id 


no 


92 ; 


seq 


id 


no 


93 ; 


seq 


id 


no 


94; 


seq 


id 


no 


95; 


seq 


id 


no 


96; 


seq 


id 


no 


97; 


seq 


id 


no 


98; 


seq 


id 


no 


99; 


seq 


id 


no 


100; 


seq 


id 


no 


101; 


seq 


id 


no 


102; 


seq 


id 


no 


103; 


seq 


id 


no 


104; 


seq 


id 


no 


105; 


seq 


id 


no 


106; 


seq 


id 


no 


107; 


seq 


id 


no 


108; 


seq 


id 


no 


109; 


seq 


id 


no 


110; 


seq 


id 


no 


ill; 


seq 


id 


no 


112; 


seq 


id 


no 


113; 


seq 


id 


no 


114; 


seq 


id 


no 


115; 


seq 


id 


no 


116; 


seq 


id 


no 


117; 
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DETFYKGKKNTERPKIRTN 
DET FYKGKNTERPK I RTN 
LSINNYRFQMKFYFRFTSHGSPFTSANF 
KLS INNYRFQMKFYFRFTSHGSPFTSANF 
NSVSTTTGFR 

NIQLAATKKLS INNYRFQMKFYFRFTSHGSPFTSANF 

NIQLAATKKKLS INNYRFQMKFYFRFTSHGSPFTSANF 

NIQLAATKKNSVSTTTGFR 

NIQLAATKNSVSTTTGFR 

MEHVAPGRMSAS PQS PTQ 

KMEHVAPGRMSAS PQS PTQ 

KWS TWLQAECQHLHS PQRS DKPQQAGLDQQHHC FALDS - 
-S PGPRPVFLQLLGLMGQGRHD 

WS TWLQAECQHLHS PQRS DKPQQAGLDQQHHC FALDS - 

-S PGPRPVFLQLLGLMGQGRHD 

TFS VWAE KMEHVAPGRMSAS PQSPTQ 

TFSVWAEKKMEHVAPGRMSASPQSPTQ 

TFSVWAEKKWSTWLQAECQHLHSPQRSDKPQQAGLDQ- 

-QHHCFALDSSPGPRPVFLQLLGLMGQGRHD 

TFSVWAEKWSTWLQAECQHLHSPQRSDKPQQAGLDQ- 

- QHHC FALDS S PGPRPVFLQLLGLMGQGRHD 

HKWLKFCLLRLVKES FHE 

KHKWLKFCLLRLVKES FHE 

KGGKAKGKKHKWLKFCLLRLVKES FHE 

KGGKAKGKKKHKWLKFCLLRLVKES FHE 

KGGKAKGKKNTNG 

KGGKAKGKNTNG 

VNNFFKKL 

KVNNFFKKL 

LS QGNVKKVNNFFKKL 

LSQGNVKKKVNNFFKKL 

GEKNDLQLFVMS DRRYKI YWTVI LLNPCGNLHLKTTS L 

KGEKNDLQLFVMSDRRYKIYWTVILLNPCGNLHLKTTSL 

KGKKMICSYS 

GKKMICSYS 

SSKTFEKKGEKNDLQLFVMSDRRYKIYWTVILLNPCGN- 
-LHLKTTSL 
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seq id no 118; S S KT FEKKKGEKNDLQL FVMS DRR YKI YWT VI LLNPCGN- 

-LHLKTTSL 

S S KT FEKKKGKKM I CS YS 
S S KT FEKKGKKMI CS YS 
QRKPKRANC VI QRRAKM 
KQRKPKRANC VI QRRAKM 
NKENQKEQTALLYRGGQRCRCVCLRF 
NKENQKEQTALLYRGGQRCRCVCLRF 
PDYQPPAKKQRKPKRANCVI QRRAKM 
PDYQPPAKKKQRKPKRANCVI QRRAKM 
PDYQPPAKKNKENQKEQTALLYRGGQRCRCVCLRF 
P D YQP PAKNKENQKE QTALL YRGGQRCRCVCLRF 
NLSSLLI 
TCLPF 

QPTFTLRKNLSSLLI 
QPT FTLRKKNLS S LL I 
QPTFTLRKKTCLPF 
QPTFTLRKTCLPF 

RATFLLSLWECSLPQARLCLIVSRTGLLVQS 
GQHFYWHCGSAACHRRGCV 

KENVRDKKRATFLLSLWECSLPQARLCLIVSRTGLLVQS 
KENVRDKKKRATFLLSLWECSLPQARLCLIVSRTGLLVQS 
KENVRDKKKGQHFYWHCGSAACHRRGCV 
KENVRDKKGQHFYWHCGSAACHRRGCV 
ITHTRWGITTWDSWSVRMKANWIQAQQNKSLILSPSFTK 
KITHTRWGITTWDSWSVRMKANWIQAQQNKSLILSPSFTK 
KLLTPGGELPHGILGQ 
LLTPGGELPHGILGQ 

PPVCELEKI THTRWG ITTWDSWSVRMKANW I QAQQNKS - 
-LILSPSFTK 

seq id no 145; PPVCELEKKI THTRWG I TTWDSWSVRMKANWI QAQQNKS - 

-LILSPSFTK 
seq id no 14 6; PPVCELEKKLLTPGGELPHGILGQ 
seq id no 147; PPVCELEKLLTPGGELPHGILGQ 
seq id no 14 8; SLKDELEKMKI 
seq id no 149; SLKDELEKKMKI 



seq id no lis 
seq id no 120 
seq id no 121 
seq id no 122 
seq id no 123 
seq id no 123 
seq id no 124 
seq id no 125 
seq id no 126 
seq id no 127 
seq id no 128 
seq id no 129 
seq id no 130 
seq id no 131 
seq id no 132 
seq id no 133 
seq id no 134 
seq id no 135 
seq id no 136 
seq id no 137 
seq id no 138 
seq id no 139 
seq id no 140 
seq id no 141 
seq id no 142 
seq id no 143 
seq id no 144 
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seq 


id no 


150; 


LGQSSPEKKNKN 


seq 


id no 


151; 


LGQSSPEKNKN 


seq 


id no 


152; 


RLRRINGRGSQIRSRNAFNRSEE 


seq 


id no 


153; 


EPKVKEEKKT 


seq 


id no 


154; 


EPKVKEEKKKT 


seq 


id no 


155; 


EPKVKEEKKRLRRINGRGSQIRSRNAFNRSEE 


seq 


id no 


156; 


EPKVKEEKRLRRINGRGSQIRSRNAFNRSEE 


seq 


id no 


157; 


T FRYKGKQHP FFS T 


seq 


id no 


158; 


GPNAPEEKNH 


seq 


id no 


159; 


GPNAPEEKKNH 


seq 


id no 


160; 


GPNAPEEKKTFRYKGKQHPFFST 


seq 


id no 


161; 


GPNAPEEKTFRYKGKQHPFFST 


seq 


id no 


162; 


MQNTCV 


seq 


id no 


163; 


KMQNTCV 


seq 


id no 


164; 


KCKIRVFSK 


seq 


id no 


165, 


• CKIRVFSK 


seq 


id no 


166, 


r FFKRT VQKMQNT CV 


seq 


id no 


167, 


; FFKRTVQKKMQNTCV 


seq 


id no 


168 


; FFKRTVQKKCKIRVFSK 


seq 


id no 


169 


; FFKRTVQKCKIRVFSK 


seq 


id no 


170 


; LPHYLAH 


seq 


id no 


171 


; CLITWLTN 


seq 


id no 


172 


; GSTTGLSATPLPHYLAH 


seq 


id no 


173 


; GSTTGLSATPPLPiixliArl 


seq 


id no 


174 


; GSTTGLSATPPCLITWlilW 


seq 


id no 


175 


; GS TTGLSATPCLI TWLTN 


seq 


id no 


176 


; RFADKPRPN 


seq 


id no 


177 


; DLPTSPDQTRSGPVHVSVEP 


seq 


id no 


178 


; DS AAGC S GT PRFADKPRPN 


seq 


id no 


179 


; DSAAGCSGTPPRFADKPRPN 


seq 


id no 


180 


; DSAAGCSGTPPDLPTSPDQTRSGPVHVSVEP 


seq 


id no 


181 


; DSAAGCSGTPDLPTSPDQTRSGPVHVSVEP 


seq 


id no 


182 


; AHPETPAQNRLRIPCSRREVRSRACKPPGAQGSDER- 
-RGKASPGRDCDVRTGRP 


seq 


id no 


183 ; PAHPETPAQNRLRIPCSRREVRSRACKPPGAQGSDER 



-RGKASPGRDCDVRTGRP 
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seq id no 184; RPTRRHPRR IAS GS PAVGGR 

seq id no 185; VAIRGHPRPPAHPETPAQNRLRIPCSRREVRSRACKP- 

- PGAQGSDERRGKASPGRDCDVRTGRP 
seq id no 186; VAIRGHPRPPPAHPETPAQNRLRIPCSRREVRSRACKP- 

- PGAQGSDERRGKASPGRDCDVRTGRP 
seq id no 187; VAIRGHPRPPRPTRRHPRR IAS GS PAVGGR 
seq id no 188; VAIRGHPRPRPTRRHPRRI AS GSPAVGGR 
seq id no 189; RGRTS GRS LS CCRRPRCRPAVASRS TAP S PRAGS R- 

-RCCLRTSCGAARPRRTRSACGDWVASPPTRSS- 
-SRTACGAASPPARSWSAP 
seq id no 190; GGGHLEEV 

seq id no 191; YFGGPDSTPRGRTSGRSLSCCRRPRCRPAVASR- 

-STAPSPRAGSRRCCLRTSCGAARPRRTRSACGD- 
- WVAS PPTRS S SRTACGAAS PPARS WSAP 

seq id no 192; YFGGPDSTPPRGRTSGRSLSCCRRPRCRPAVASR- 

-STAPSPRAGSRRCCLRTSCGAARPRRTRSACGDW- 
-VASPPTRSSSRTACGAASPPARSWSAP 

seq id no 193; YFGGPDSTPPGGGHLEEV 

seq id no 194; YFGGPDSTPGGGHLEEV 

seq id no 195; HRVADP 

seq id no 196; LSQSSELDPPSSR 

seq id no 197; LSQSSELDPPPSSR 

seq id no 198; LSQSSELDPPHRVADP 

seq id no 199; LSQSSELDPHRVADP 

seq id no 200; VILLPEDTPPS 

seq id no 201; VILLPEDTPPPS 

seq id no 202; VILLPEDTPPLLRA 

seq id no 203; VILLPELDPLLRA 

seq id no 204; PSPLP 

seq id no 205; PLLFHRPCSPSPALGATVLAVYRYE 
seq id no 206; LLFHRPCS PS PALGATVLAVYRYE 
seq id no 207; APRPPLGPPSPLP 
seq id no 208; APRPPLGPPPSPLP 

seq id no 209; APRPPLGPPPLLFHRPCS PS PALGATVLAVYRYE 
seq id no 210; APRPPLGPPLLFHRPCSPSPALGATVLAVYRYE 
seq id no 211; TQVLPQGCSLSLLHTTFPHRQVPHILDW 
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seq id no 212; PTQVLPQGCSLSLLHTTFPHRQVPHILDW 

seq id no 213; PLQSFPKDAASAFSTPRFPTDKFPTSWTGSCPGQPHGT- 

-RAFCQPGPE FNAFSAC 
seq id no 214; LQSFPKDAASAFSTPRFPTDKFPTSWTGSCPGQPHGT- 

-RAFCQPGPEFNAFSAC 
seq id no 215; PSPRPQSQPPTQVLPQGCSLSLLHTTFPHRQVPHILDW 
seq id no 216; PSPRPQSQPPPTQVLPQGCSLSLLHTTFPHRQVPHILDW 
seq id no 217; PSPRPQSQPPPLQSFPKDAASAFSTPRFPTDKFPTS- 

-WTGSCPGQPHGTRAFCQPGPEFNAFSAC 
seq id no 218; PSPRPQSQPPLQSFPKDAASAFSTPRFPTDKFPTS- 

-WTGSCPGQPHGTRAFCQPGPEFNAFSAC 
seq id no 219; TAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 220; PTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 221; PRPGPAGGALLPRSLTAFVPHSGHGLPVSSGEPAYTPIP- 

- HDVPHGT P P FC 

seq id no 222; RPGPAGGALLPRSLTAFVPHSGHGLPVSSGEPAYTPIPH- 

-DVPHGTPPFC 

seq id no 223; DLPAVPGPPTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 224; DLPAVPGPPPTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 225; DLPAVPGPPPRPGPAGGALLPRSLTAFVPHSGHGLPVSSG- 

-EPAYTPIPHDVPHGTPPFC 
seq id no 226; DLPAVPGPPRPGPAGGALLPRSLTAFVPHSGHGLPVSSG- 

- E PAYT P I PHDVPHGT P P FC 
seq id no 227; QWGLSWMS 
seq id no 228; NGDCHGCPEGRQSL 
seq id no 229; FTMDRVLTPQWGLSWMS 
seq id no 230; FTMDRVLTPPQWGLSWMS 
seq id no 231; FTMDRVLT P PNGDCHGC PEGRQS L 
seq id no 232; FTMDRVLTPNGDCHGCPEGRQSL 
seq id no 233; HHPARQCPHCIMHLQTQLIHRNLTGPSQLTSLHRS- 

- PYQIAAT PWTTD FAAS FFLNPVT P FLLCRRCQGKDV- 

-LCTNARCLSQTSPSHHKALSRTTTQCMNT- 

- TPWLAVRPAKAFPLL 
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seq id no 234; PHHPARQCPHCIMHLQTQLIHRNLTGPSQLTSLHRS- 

-PYQIAATPWTTDFAASFFLNPVTPFLLCRRCQGK- 

-DVLCTNARCLSQTSPSHHKALSRTTTQCMNTTP- 

-WLAVRPAKAFPLL 
seq id no 235; HTIQHASVPTASCISKLNSYTEN 
seq id no 236; PQVGMRPSNPPHHPARQCPHCIMHLQTQLIHRNLT- 

-GPSQLTSLHRSPYQIAATPWTTDFAASFFLNPVTPFL- 

-LCRRCQGKDVLCTNARCLSQTSPSHHKALSRTTTQC- 
-MNTTPWLAVRPAKAFPLL 
seq id no 237; PQVGMRPSNP PPHHPARQCPHCIMHLQTQLIHRNLTGPS 

- QLT S LHRS P YQ I AATPWT TDFAAS FFLNPVT P FLLCRRC 

-QGKDVLCTNARCLSQTSPSHHKALSRTTTQCMNTTPWLA 
-VRPAKAFPLL 

seq id no 238; PQVGMRP SNPPHT I QHAS VPTASC I SKLNS YTEN 

seq id no 239; PQVGMRPSNPHTIQHASVPTASCISKLNSYTEN 

seq id no 24 0; WAARSWCERRAAAVAPLAPWAWGCPAGCTPPVAARAC- 

-AATRPEGWRSPCTH 
seq id no 241; PWAARSWCERRAAAVAPLAPWAWGCPAGCTPPVAA- 

-RACAATRPEGWRSPCTH 
seq id no 242; RGLRGAGARGGLRLLRHLRPGLGDALRGVHPPLR- 

-LGPALLPAPRGGEAPAHTDARARRVHGAGGDRGHPGPAAL 
seq id no 243; EEKLARCRPPWAARSWCERRAAAVAPLAPWAWGCPAGC- 

- TP PVAARACAATRPEGWRS PCTH 
seq id no 244; EEKLARCRPP PWAARSWCERRAAAVAPLAPWAWGCPA- 

-GCTP PVAARACAATRPEGWRS PCTH 
seq id no 245; EEKLARCRPPRGLRGAGARGGLRLLRHLRPGLGDA- 

-LRGVHPPLRLGPALLPAPRGGEAPAHTDARARRVHGAGG- 
-DRGHPGPAAL 

seq id no 246; EEKLARCRPRGLRGAGARGGLRLLRHLRPGLGDALRG- 

-VHPPLRLGPALLPAPRGGEAPAHTDARARRVHGAGG- 
-DRGHPGPAAL 

seq id no 247; QPPVSPRPRRPGRPRAPPPPQPMVSPRRRTTGPPW- 

-RPPPLQSTMSPPPQALHQAQLLLWCTTAPLPGLPQPQ- 

- PARALHSQFPATTLILLPPLPAIAPRLMPVALT IARYL- 

-LSPPPITALLPSCLLGSLSFSCLFTFQTSSLIPLW- 
-KIPAPTTTKSCRETFLKW 
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seq id no 248; 



seq id no 249; 



seq id no 250; 



seq id no 251; 



seq id no 252, 



seq id no 253, 



seq id no 254; 



seq id 
seq id 
seq id 
seq id 
seq id 
seq id 
seq id 
seq id 
seq id 
seq id 
seq id 



no 255 
no 256 
no 257 
no 258 
no 259 
no 260 
no 2 61 
no 262 
no 263 
no 264 
no 265 



SPGCHLGPGDQAAPGLHRPPSPWCHLGAGQQARLGVHR- 
-PSSPQCHLGLRLCIRLSFYSGAQRHLCQGYHNPSQQEHS- 

-ILNSQPPL 

KPAPGSTAPQPPVSPRPRRPGRPRAPPPPQPMVSPRR- 
-RTTGPPWRPPPLQSTMSPPPQALHQAQLLLWCTTAP- 
-LPGLPQPQPARALHSQFPATTLILLPPLPAIAPRLMPVA- 
-LTIARYLLSPPPITALLPSCLLGSLSFSCLFTFQTS- 

- SL I PLWKI PAPTTTKS CRET FLKW 

KPAPGSTAPPQPPVSPRPRRPGRPRAPPPPQPMVSPR- 

-RRTTGPPWRPPPLQSTMSPPPQALHQAQLLLWCT- 

-TAPLPGLPQPQPARALHSQFPATTLILLPPLPAIAP- 

- RLMPVALT I ARYLL S P P P I TALLP SCLLGSLSFS CL F- 

-TFQTSSLIPLWKIPAPTTTKSCRETFLKW 

KPAPGS TAP PS PGCHLGPGDQAAPGLHRPPS PWCHL- 

-GAGQQARLGVHRPS S PQCHLGLRLC I RLS FYSGA- 1 - 

-QRHLCQGYHNPSQQEHS ILNSQPPL 

KPAPGS TAPS PGCHLGPGDQAAPGLHRPPS PWCHL- 

-GAGQQARLGVHRPS S PQCHLGLRLC I RLS FYS GAQ- 

-RHLCQGYHNPSQQEHS ILNSQPPL 

QPMVS PRRRT T GP PWRPP PLQS TMS PP PQALHQAQL- 

-LLWCTTAPLPGLPQPQPARALHSQFPATTLILLPPLP- 

- AI APRLMPVALT I ARYLLS P PP I TALLP S CLLGS L- 
- S FS CL FT FQT S S L I PLWKI PAP T T TKS CRET FLKW 
S PWGHLGAGQQARLGVHRP S S PQCHLGLRLC IRLS F- 
-YSGAQRHLCQGYHNPSQQEHS ILNSQPPL 

RPPPGSTAPQPMVSPRRR 
RPPPGSTAPPQPMVSPRRR 
RPPPGSTAPPSPWCHLGA 
RPPPGS TAPS PWCHLGA 
RPRAPPPPSPWCHL 
RPRAPPPPPSPWC 
RPRAPPPPAHGVTSAP 
RPRAPPPPPAHGV 
APGLHRPPQPMVS P 
AAPGLHRPQPMVSPR 
PGLHRP P PAHGVT 
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seq id no 266; APGLHRPPAHGVTS 

seq id no 2 67; VDRPQHTEWLSWSNLYRIRHQ 

seq id no 2 68; HYLCTDVAPR 

seq id no 2 69; H YLC TDVAP PR 

seq id no 270; HYLCTDVAPPVDRPQHTEWLSWSNLYRIRHQ 
seq id no 271; HYLCTDVAP VDRPQHTEWLSWSNLYRIRHQ 
seq id no 272; SAYLSPLGTTWLRTCACRLPRPAASCLCTTPSLLW- 

-PRRTCPAGSPRATSSPWRMPAPKSCCTTGLAFTS- 
-PIGLGWRSATASGYARIWPVLSLTCQSWSTSLPSTAVTW 
seq id no 273; PSAYLSPLGTTWLRTCACRLPRPAASCLCTTPSLLWP- 

-RRTCPAGSPRATSSPWRMPAPKSCCTTGLAFTSP- 

-IGLGWRSATASGYARIWPVLSLTCQSWSTSLPSTAVTW 
seq id no 274; PAPI FLLWGPLG 

seq id no 275; AP I FLLWGPLG 

seq id no 276; LPARAPGPPSAYLSPLGTTWLRTGACRLPRPAASCL- 

-CTTPSLLWPRRTCPAGSPRATSSPWRMPAPKSCC- 
-TTGLAFTSPIGLGWRSATASGYARIWPVLSLT- 
-CQSWSTSLPSTAVTW 
seq id no 277; LPARAPGPPPSAYLS PLGTTWLRTCACRLPRPAAS - 

-CLCTTPSLLWPRRTCPAGSPRATSSPWRMPAPKSCC- 
-TTGLAFTSPIGLGWRSATASGYARIWPVLSLTC- 
-QSWSTSLPSTAVTW 
seq id no 278; LPARAPGPPPAP I FLLWGPLG 
seq id no 279; LPARAPGPPAPI FLLWGPLG 
seq id no 280; DLEHHGGVTRHRHR 
seq id no 281; LVSDYSMTPRP 
seq id no 282; LVSDYSMTPPRP 
seq id no 283; LVSDYSMTPPDLEHHGGVTRHRHR 
seq id no 284; LVS DYSMT PDLEHHGGVTRHRHR 
seq id no 285; FHHIATDVGPFVRIGFLKIKGKIKGKSLRKPNW- 

-KTQHKLKRALMFL I VKKL 
seq id no 286; P FHH I ATDVG P FVR I G FLK I KGK I KGKS LRKPNWK - 

- TQHKLKRALMFL I VKKL 
seq id no 287; PS I TLQQMLAPS 
seq id no 298; SI TLQQMLAPS 
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seq id no 289; TSCNEMNPPFHHIATDVGPFVRIGFLKIKGKIKGKSL- 

-RKPNWKTQHKLKRALMFL IVKKL 
seq id no 290; TSCNEMNPPPFHHIATDVGPFVRIGFLKIKGKIKG- 

-KSLRKPNWKTQHKLKRALMFLIVKKL 
seq id no 291; TSCNEMNPPS ITLQQMLAPS 
seq id no 292; TSCNEMNPPPS ITLQQMLAPS 
seq id no 293; LEMILFLMTF 
seq id no 294; HPC I TKT FLEMI L FLMT F 
seq id no 295; HPCITKTFFLEMILFLMTF 
seq id no 296; HPCITKTFFWR 
seq id no 297; HPCITKTFWR 
seq id no 298; LMFEHSQMRLNSKNAHLPIISF 
seq id no 299; EYGS I IAFLMFEHSQMRLNSKNAHLP 1 1 S F 
seq id no 300; EYGSIIAFFLMFEHSQMRLNSKNAHLPIISF 

seq id no 301; HLNKGRRLGDKI RAT 

seq id no 302; FHLNKGRRLGDKIRAT 

seq id no 303; VTSGTPFFHLNKGRRLGDKIRAT 

seq id no 304; VTSGTPFFFHLNKGRRLGDKIRAT 

seq id no 305; VTSGTPFFFI 

seq id no 306; VTSGTPFFI 

seq id no 307; CEIERIHFFF 

seq id no 308; CEIERIHFFSK 

seq id no 309; CEIERIHFSK 

seq id no 310; FRYISKSI 

seq id no 311; RYISKSI 

seq id no 312; FKKYEPIFFRYISKSI 

seq id no 313; FKKYEPI FRYISKSI 

seq id no 314; FPDSDQPGPLYPLDPSCL I S S ASNPQELSDCHYIH- 

-LAFGFSNWRSCPVLPGHCGVQ 
seq id no 315; PDSDQPGPLYPLDPSCLISSASNPQELSDCHYIHL- 

-AFGFSNWRSCPVLPGHCGVQ 
seq id no 316; LNMFASVFS 
seq id no 317; LNMFASVFFS 

seq id no 318; LNMFASVFFPDS DQPGPLYPLDPS CLI S SASNPQE- 

- LSDCHY I HLAFGFSNW RSCPVLPGHCGVQ 
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seq id no 319; LNMFASVFPDSDQPGPLYPLDPSCLISSASNPQELS- 

-DCHYIHLAFGFSNWRSCPVLPGHCGVQ 
seq id no 320; AME E TVWAVAT VE TE VEAME E T G WAAMEE TE VGAT - 

-EETEVAMEAKWEEETTTEMISATDHT 
seq id no 321; LWVRPWLWEWLRWRPKWRLWRRQEWWRLWRRPRWGL- 

-RRRPRWLWRENGRKKRLQK 
seq id no 322; YGGDRS RGAME E TVWAVAT VE TE VEAME E T G WAAM- 

- EE TE VGATEE TEVAMEAKWEEE T TTEMI S ATDHT 
seq id no 323; YGGDRS RGGAMEE TVWAVAT VE TEVEAMEE TGVVA- 

-AMEETEVGATEETEVAMEAKWEEETTTEMISATDHT 
seq id no 324; YGGDRSRGGLWVRPWLWEWLRWEPKWRLWRRQEWW- 

-RLWRRPRWGLRRRPRWLWRENGRKKRLQK 
seq id no 325; YGGDRSRGLWVRPWLWEWLRWEPKWRLWRRQEWWR- 

-LWRRPRWGLRRRPRWLWRENGRKKRLQK 
seq id no 32 6; EFGGGRRQK 

seq id no 327; EFGGRRQK 
seq id no 328; RRAKGGGAGASNPRQ 
seq id no 329; GRRAKGGGAGASNPRQ 
seq id no 33.0; DVGLREGALELPTRGNKRNVA 
seq id no 331; MRGGGGVGGRRAKGGGAGASNPRQ 
seq id no 332; MRGGGGVGGGRRAKGGGAGASNPRQ 
seq id no 333; MRGGGGVGGDVGLREGALELPTRGNKRNVA 
seq id no 334; MRGGGGVGDVGLREGALELPTRGNKRNVA 
seq id no 335; VWQLAGPMLAGWRSLGSWFCRMYGI 
seq id no 336; CGSWPALCWRAGGVWAVGSAGCMEYDPEALPAAWGP- 
-AAAATVHPRR 

seq id no 337; RRYPCEWGVWQLAGPMLAGWRSLGSWFCRMYGI 
seq id no 338; RRYPCEWGGVWQLAGPMLAGWRS LGSWFCRMYG I 
seq id no 339; RRYPCEWGGCGSWPALCWRAGGVWAVGSAGCMEYD- 

-EALPAAWGPAAAATVHPRR 
seq id no 340; RRYPCEWGCGSWPALCWRAGGVWAVGSAGCMEYDPE- 

-ALPAAWGPAAAATVHPRR 

seq id no 341; LWLWAGWTVWWSCGPGEKGHGWPSLPTMALLLLRFSCM- 
-RVASY 
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seq id no 342; 
seq id no 343; 

seq id no 344; 

seq id no 345; 
seq id no 346; 
seq id no 347; 

seq id no 348; 



seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 

seq 



id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 
id no 



349 

350 

351 

352 

353 

354 

355 

356 

357 

358 

359 

360 

361 

362 

363 



seq id no 364 



GLWLWAGWTVWWSCGPGEKGHGWPSLPTMALLLL- 
-RFSCMRVASY 

GCGCGPAGQYGGAVGLARRGTAGCLPCPPWLCCCCAF- 
-PACGLPGTDGWRGWQGSGCVRVSGSAPWAPGFPFSP- 

-PCPLCGTQPRW 

CGCGPAGQYGGAVGLARRGTAGCLPCPPWLCCCCAFPACG- 
-LPGTDGWRGWQGSGCVRVSGSAPWAPGFPFSPPC- 

-PLCGTQPRW 

LAFNVPGGLWLWAGWTVWWSCGPGEKGHGWPSLPTMA- 
-LLLLRFSCMRVASY 

LAFNVPGGGLWLWAGWTVWWSCGPGEKGHGWPSLPTM- 
- ALLLLRFS CMRVAS Y 

LAFNVPGGGCGCGPAGQYGGAVGLARRGTAGCLPCPP- 
-WLCCCCAFPACGLPGTDGWRGWQGSGCVRVSGSAPW- 

-APGFPFSPPCPLCGTQPRW 

LAFNVPGGCGCGPAGQYGGAVGLARRGTAGCLPCPPW- 
-LCCCCAFPACGLPGTDGWRGWQGSGCVRVSGSAPWA- 

-PGFPFSPPCPLCGTQPRW 

PPMPMPGQREAPGRQEA 

GPPMPMPGQREAPGRQEA 

GHQCQCQGKGRHRADRRPDTAQEE 

HQCQCQGKGRHRADRRPDTAQEE 

GGHSYGGGPPMPMPGQREAPGRQEA 

GGHSYGGGGPPMPMPGQREAPGRQEA 

GGHSYGGGGHQCQCQGKGRHRADRRPDTAQEE 

GGHSYGGGHQCQCQGKGRHRADRRPDTAQEE 

APCPQSSGGG 
LPAPSQAAADELDRRPG 
TKVRL I RGAPCPQS S GGG 
TKVRLIRGGAPCPQS SGGG 
TKVRLIRGGLPAPSQAAADELDRRPG 

TKVRLIRGLPAPSQAAADELDRRPG 
CSLAKDGSTEDTVSSLCGEEDTEDEELEAAASHLNK- 

-DLYRELLGG 

GCSLAKDGSTEDTVSSLCGEEDTEDEELEAAASHLNK- 
-DLYRELLGG 
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seq id no 365; AAAWQKMAP PRT PRPACVARR 

seq id no 366; ENSRPKRGGCS LAKDGS TEDTVSS LCGEEDTEDEELE- 

-AAASHLNKDLYRELLGG 
seq id no 367; ENSRPKRGGGCSLAKDGSTEDTVSSLCGEEDTEDE- 

-ELEAAASHLNKDLYRELLGG 
seq id no 3 68; ENSRPKRGGAAAWQKMAPPRTPRPACVARR 
seq id no 369; ENSRPKRGAAAWQKMAP PRT PRPACVARR 
seq id no 370; HCVLAASGAS 
seq id no 371; GHCVLAASGAS 

seq id no 372; GTASSRPLGLPKPHLHRPVPIRHPSCPK 
seq id no 373; TASSRPLGLPKPHLHRPVPIRHPSCPK 
seq id no 374; AGTLQLGGHCVLAASGAS 
seq id no 375; AGTLQLGGGHCVLAASGAS 

seq id no 376; AGTLQLGGGTASSRPLGLPKPHLHRPVPIRHPSCPK 
seq id no 377; AGTLQLGGTASSRPLGLPKPHLHRPVPIRHPSCPK 
seq id no 378; RRTPSTEKR 
seq id no 379; RRTPSTEKKR 
seq id no 380; RRT PS TEKKGRS EC 
seq id no 381; RRTPS TEKGRSEC 

seq id no 382; STTKCQSGTAETYNSWKVKNLQLEPRRVTSQMNRQVK- 
-DMTAILSQS 

seq id no 384; SSEEIKKKSTTKCQSGTAETYNSWKVKNLQLEPRRV- 

-TSQMNRQVKDMTAILSQS 
seq id no 385; SSEEIKKKKSTTKCQSGTAETYNSWKVKNLQLEPRR- 

-VTSQMNRQVKDMTAILSQS 
seq id no 386; S SEE IKKKKVQPNASQAQQKPTTHGR 
seq id no 387; SSEEIKKKVQPNASQAQQKPTTHGR 
seq id no 388; NRGWVGAGE 
seq id no 389; IEAG 
seq id no 390; VHNYCNMKNRGWVGAGE 
seq id no 391; VHNYCNMKKNRGWVGAGE 
seq id no 392; VHNYCNMKKIEAG 
seq id no 393; VHNYCNMKI EAG 
seq id no 394; QLRCWNTWAKMFFMVFLI I WQNTMF 
seq id no 395; VKKDNHKKQLRCWNTWAKMFFMVFL 1 1 WQNTMF 
seq id no 396; VKKDNHKKKQLRCWNTWAKMFFMVFLI I WQNTMF 
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seq id no 397; VKKDNHKKKNS 
seq id no 398; VKKDNHKKNS 

seq id no 399; GAEESGPFNRQVQLKVHASGMGRHLWNCPAFWSEV 
seq id no 400; HPSPPPEKRS 
seq id no 401; HPSPPPEKKRS 

seq id no 402; HPSPPPEKKGAEESGPFNRQVQLKVHASGMGRHLW- 

-NCPAFWSEV 

seq id no 403; HPSPPPEKGAEESGPFNRQVQLKVHASGMGRHLWN- 

-CPAFWSEV 

seq id no 404; MQVLSKTHMNLFPQVLLQMFLRGLKRLLQDLEKSKKRKL 
seq id no 405; RCKSARLI 

seq id no 406; VQTQPAIKKMQVLSKTHMNLFPQVLLQMFLRGLKRLLQ- 

-DLEKSKKRKL 

seq id no 407; VQTQPAIKKKMQVLSKTHMNLFPQVLLQMFLRGLIOIL- 
15 - LQDLEKS KKRKL 

seq id no 408; VQTQPAIKKRCKSARLI 
seq id no 409; VQTQPAIKRCKSARLI 
seq id no 410; ARS GKKQKRKL 
seq id no 411; ARS GKKQKKRKL 
20 seq id no 412; ARS GKKQKKENFS 
seq id no 413; ARS GKKQKENFS 
seq id no 414; KASARSGKSKKRKL 
seq id no 415; KASARSGKKS KKRKL 
seq id no 416; KAS ARS GKKAKKENS F 
25 seq id no 417; KASARSGKAKKENS F 
seq id no 418; HLNKGRRLGDKI RAT 
seq id no 419; VTSGTPFFHLNKGRRLGDKIRAT 
seq id no 420; VTSGTPFFFHLNKGRRLGDKIRAT 
seq id no 421; VTSGTPFFFI 
30 seq id no 422; VTSGTPFFI 

seq id no 423; VTLLYVNTVTLAPNVNMESSRNAHSPATPSAKRK- 

-DPDLTWGGFVFFFCQFH 
seq id no 424; KCRCKPNFFVTLLYVNTVTLAPNVNMESSRNAHSP- 

-ATPSAKRKDPDLTWGGFVFFFCQFH 
35 seq id no 425; KCRCKPNFFFVTLLYVNTVTLAPNVNMESSRNAH- 

-SPATPSAKRKDPDLTWGGFVFFFCQFH 
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seq id no 426; KCRCKPNFFL 

seq id no 427; KCRCKPNFL 

seq id no 429; LVKKLKEKKMNWIL 

seq id no 430; LVKKLKEKKKMNWIL 

seq id no 431; LVKKLKEKKR 

seq id no 432; LVKKLKEKR 

seq id no 433; AAIVKDCCR 

seq id no 434; S QPAS I LGRKL 

seq id no 435; SQPASILGKRKL 

seq id no 436; SQPAS ILGKAAIVKDCCR 

seq id no 437; SQPAS ILGAAIVKDCCR 



Examples of cancers particularly suitable for treatment 
with one or a combination of several of this compounds 
are: colorectal cancer, breast cancer, small-cell lung 
cancer, non small-cell lung cancer, liver cancer (primary 
and secondary) , renal cancer, melanoma, ovarian cancer 
cancer of the brain, head and neck cancer, pancreatic 
cancer, gastric cancer, eosophageal cancer, prostate 
cancer and leukemias and lymphomas. 

Below are listed some examples of where these mutations 
may result in gene products that result in development of 
tumours : 

Development of colorectal cancers are believed to result 
from a series of genetic alterations. Deleted in 
colorectal cancer (DCC) gene (seq id nos 30-34), Human 
putatxve mismatch repair /binding protein (hMSH3, gene 
(Seq id hos 134-147), Human hMSH6 gene (seq id nos 201-204 
and 295-299), Human n-myc gene (seq id nos 190-195), Human 
TGFP2 (hTGF{32) gene (seq id nos 22-29), Human p53 
associated gene (seq id nos 287-294) may be involved in 
colorectal cancer • 
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Human breast cancer susceptibility (BRCA2) »» 
35-94, and Human BRCAl-associated RING domain protein 

> gene (seq id nos 404-4X3, are involved x» breast 
^er and ovarian cancer Human hMS«6 gene (se^idnos 
201 -204 and 295-299) may be involved in brain tumours. 

Gene alteration are frequent in many types of 
adenocarcinomas , below are listed some genes that 
mutated in many cancers: 

Human breast cancer susceptibility (BRCA2, gene <se, id 
Human or ,„„,.•., i cancer (DCC, gene (seq id 

nos 35-94), Deleted in colorectal cancer i y 

nos 30-34), Human putatative mismatch 

protein (hMSH3, gene (seq id nos 134-147, , Human 

gene seq id nos 201-204 and 295-299, , human N-MYC gene 

seq id no 190-X9S), Human TG.b2 «~ "° S 

22-29) Human p53 associated gene (seq id nos 287 294), 
22-29), Huma p 248 . 2 67), Human germline n-myc 

:r,r q iHL' »»» ^ s «««, 

IssL a ed protein (seq id nos 388-393,, H,«an nasopharynx 
carcinoma EBV BNLF-1 gene «seq id nos 205-211), Human 
forming growth factor-beta inducted gene product 
(BIGH3) seq id nos 228-233) . 

Many of the mutated genes may result in development of 
"euLmias and lymphomas: Human neurof ibro»in (NF1) gene 
(se q id nos 178-183), b-raf oncogene (seq id nos 172 177,, 
Human protein-tyrosine Kinase ,»«> gene (seq id nos 
268-272), Human protein-tyrosine kinase ( JAK3) gene (seq 
id nos 273-280) are examples. 

Genes involved in malignant melanoma: Human 
melanoma metastasis-supressor (hKiSS-l) gene (seq id 
331-337), Genes involved in metastasis: Human 
metastasis-assosiated mtal (hMTAl) gene (seq id nos 

360-365) . 
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Cell cycle control and signal transduction is strikcly 
regulated. Frameshift mutations in these genes may result 
in uncontrolled cell growth. Examples of genes which may 
be suseptable are: Human protein tyrosine phosphatase 
(hPTP) gene (seq id nos 95-102), Human kinase (TTK) gene 
(seq id nos 109-121), Human transcriptional repressor 
(CTCF) gene (seq id nos 122-128), Human cell cycle 
regulatory protein (ElA-binding protein) p300 gene (seq id 
nos 212-219), Human tranforming growth factor-beta 
inducted gene product (BIGH3) (seq id nos 228-233), Human 
FLt4 gene (for transmembrane tyrosinase kinase (seq id nos 
281-286), Human G protein-coupled receptor (hGPRl) gene 
(seq id nos 317-322), Human transcription factor (hITF-2) 
gene (seq id nos 329-330), Human telomerase-associated 
protein TP-1 (hTP-1) gene (seq id nos 338-351), Human 
transcription factor TFIIB 90 kDa subunit (hTFBIIB90) gene 
(seq id nos 366-373), Human FADD-homologous ICE/CED-31ike 
protease gene (seq id nos 129-133) 

Mutations in DNA synthesis or -repair enzymes may also 
lead to uncontrolled cell growth. Human DNA topoisomerase 
II (top2) gene (seq id nos 103-108) and Human putative 
mismatch repair/binding protein (hMSH3) gene (seq id nos 
134-147) and (hMSH6) gene (seq id nos 201-204 and 
205-299) . 



The following are tumour suppressor genes, Human 
retinoblastoma binding protein 1 isoform I (hRBPl) gene 
(seq id hos 148-158), Human neurof ibromin (NF1 ) gene (seq 
id nos 178-183) , Human p53 associated gene (seq id nos 
287-294), Human retinoblastoma related protein (pl07) gene 
(seq id nos 312-316), Human tumour suppressor (hLUCA-1) 
gene (seq id nos 374-381), Mutations in these genes may 
result in development of cancer. 
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The following are oncogenes, proto-oncogenes or putative 
oncogenes ; Human germline n-myc gene (seq id nos 
184-189), Human n-myc gene (seq id nos 190-195), Human can 
(hCAN) gene (seq id nos 300-302), Human dek (hDEK) gene 
(seq id nos 309-311), b-raf oncogene (seq id nos 172-177), 
Human DEL (hDBL) proto-oncogene / Human MCF2PO (hMCF2P0) 
gene (seq id nos 303-308) . Frameshift mutations in these 
genes may lead to development of cancer. 



RTOT.QGICA T. F.XPKRIMENTS 

riPfirriptinn of th ^ Figures 
FIG. 1; 

It has been demonstrated that T cells from normal donors 
can be stimulated with a mixture of peptides containing 
both mutant BAX and mutant TGFpRII peptides. Peptide 
mixture dependent T cell proliferation in blood samples 
from six different donors are shown in figure 1. The 
results were obtained by stimulating peripheral blood 
mononuclear cells (PBMCs) from each donor with a mixture of 
mutant BAX peptides (seq id nos 1,9-12) and mutant TGFpRII 
peptides (seq id nos 15-21). The concentration of each 
individual peptide in the mixture was 20 uM. After two 
weeks, and weekly thereafter, the bulk cultures were 
restimulated with autologous PBMCs pulsed with 10-25 uM of 
the peptide mixture. After 4-5 restimulations the bulk 
cultures were tested in a standard proliferation assay with 
PBMCs alone or as a control or PBMCs pulsed with 25 MM of 
the peptides as antigen presenting cells (APCs) . 

FIG. 2: 

It has further been found that T cell clones can be 
generated against separate peptides of the mixture used in 
the bulk stimulation experiments. Figure 2 shows the 
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proliferation of T cell clone 521-2 which was obtained by 
cloning the bulk culture from donor 1 (figure 1) by seeding 
5 cells per well in U-bottomed, 96-well microtiter plates 
and using autologous PBMCs pulsed with 25 uM of the mutant 
BAX peptide with seq id no 12 as feeder cells. Autologous 
B-lymphoblastoid cells were used as APCs in the 
proliferation assay. 

FIG. 3: 



In figure three it is shown that mutant BAX peptides and 
mutant TGF0RII peptides can be used to stimulate T cells 
(PBMCs) from a patient with breast cancer. Dendritic cells 
(DCs) from the same cancer patient were used as APCs. The T 
cell stimulation (figure 3) was obtained by pulsing DCs 
separately with a mixture of mutant BAX peptides (seq id 
nos 1,9-12) and a mixture of mutant TGF0RII peptides (seq 
id nos 15-21) followed by addition of autologous PBMCs and 
10 ng/ml tumour necrosis factor. The concentration of each 
peptide in the mixtures used for pulsing was 25 uM. The 
PBMCs and the DCs were obtained by leukapheresis from a 
patient with breast cancer who had been on a granulocyte 
colony stimulating factor (G-CSF) treatment. The CD34+ 
cells were isolated from the cell product before DCs were 
derived using standard methods. 

FIG. 4; 

Figure 4 shows the capability of T cells obtained from 
ascites fluid of a pancreatic cancer patient to recognise 
and proliferate to different synthetic peptides derived 
from mutant BAX (seq id nos 1,9-12) and mutant TGF0RII (seq 
id nos 15,17-21). The T cell line was obtained after 
expansion of T cells present in the ascites fluid of a 
patient with pancreatic adenocarcinoma. The T cell line was 
expanded in vitro by culturing with 100 U/ml recombinant 
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interleukin-2 (rIL-2) (Amersham, Aylesbury, UK) for one 
week before beeing tested in a proliferation assay. 

Autologous, irradiated (30Gy) PBMCs were seeded 5 x 104 in 
u-bottomed 96-well plates (Costar, Cambridge, MA) and 
pulsed with single synthetic peptides at 20 uM for 2h. The 
T cells were added 5 x 104 per well and the plates were 
incubated for four days at 37 »C with addition of 18.5 x 104 
Bq/mL 3H-thymidine for the last 12 hours before harvesting. 
The plates were counted in a liquid scintillation counter 
(Packard Topcount) . Data represent specific proliferation 
to the different synthetic peptides and values are 
expressed as the mean of triplicate cultures. These results 
show that T cells isolated from a pancreatic cancer patient 
are capable of responding to a panel of peptides carrying 
amino acid sequences derived from mutant BAX and TGFjiRI 



FIG. 5: 



Figure 5 further demonstrates the capability T cells from 
another pancreatic cancer patient to recognise and 
proliferate to different synthetic peptides derived from 
mutant BAX and mutant TGF P RII. The T cell line was obtained 
after expansion of T cells present in the ascites fluid of 
a patient with pancreatic adencarcinoma . The experiment was 
set up in the same way as described above. Data represent 
specific proliferation to the different synthetic peptides 
and values are expressed as the mean of triplicate 
cultures . 

in order to investigate the T cell response from the latter 
pancreatic cancer patient, responding T cells were cloned 
Peritoneal macrophages were irradiated (30 Gy) and plated 1 
x 104 into U-bottomed 96-well plates (Costar) together with 
25 uM of each peptide. T cell blasts were counted in a 
microscope and added 5 blasts per well together with 100 
U/ml human recombinant interleukin-2 (rIL-2) (Amersham, 
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Aylesbury, UK) in a total volume of 200 mL. After 14 days T 
cell clones were transferred onto 24-well plates (Costar) 
with 1 mg/mL phytohemagglutinin (PHA, Wellcome, Dartford, 
UK), 100 U/ml rIL-2 and allogeneic, irradiated PBMCs as 
5 feeder cells and screened for peptide specificity after 7 
and 14 days. 

FIG. 6: 

T cell clone 520.5, 520.7 and 520.8 were selected for 

10 further characterisation and express the cell surface 
phenotype CD3+, CD8+ and TcR+. Figure 6 shows the 
recognition and cytotoxicity of T cell clone 520.5, 520.7 
and 520.8 against peptide-pulsed autologous target cells 
pulsed with the seq id no 10 peptide. Autologous 

15 Epstein-barr virus transformed B-cells (EBV) were labelled 
with 3H- thymidine (9.25 x 104 Bq/ml) over night, washed 
once and plated 2500 cells per well in 96-well plates with 
or without 25 mM of synthetic peptide (seq id no 10) and 1% 
DMSO in medium. After 30 minutes incubation at 37 °C the 

20 plates were washed before addition of T cells. The plates 
were further incubated at 37 °C for 4 hours and then 
harvested before counting in a liquid scintillation counter 
(Packard Topcount) . Data represent percent specific lysis 
of 3H-thymidine labelled peptide pulsed target cells at an 

25 effector/target ratio of 10/1. Values are expressed as the 
mean of triplicate cultures. These results demonstrate that 
the three different T cell clones obtained from ascites 
fluid of a pancreatic carcinoma patient, exhibit specific 
cytotoxicity of autologous EBV targets pulsed with the 

30 relevant peptide (seq id no 10) derived from mutant BAX. 

FIG. 7; 

Figure 7 shows the cytolytic properties of three different 
T cell clones obtained from the same patient. These T cell 
35 clones were cultured and expanded as described above, but 
they were generated against a synthetic peptide the seq id 
no 17 peptide carrying amino acid sequences derived from 
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mutant TGFPRII. T cell clone 538.1, 538.3 and 538.4 all 
show the cell-surface phenotype CD3+, CD8 + and TcR + . The 
experimental conditions were as described above (figure 6) . 
Data represent percent specific lysis of 3H-thymidine 
labelled peptide pulsed target cells pulsed with the seq id 
no 428 peptide at an effector/target ratio of 10/1. Values 
are expressed as the mean of triplicate cultures. These 
results demonstrate that the three different T cell clones 
obtained from ascites fluid of a pancreatic carcinoma 
patient, exhibit specific cytotoxicity of autologous EBV 
targets pulsed with the relevant peptide (seq id no 428) 
derived from mutant TGFpRII. 



Synthesis 

The peptides were synthesised by using continuous flow 
solid phase peptide synthesis. N-a-Fmoc-amino acids with 
appropriate side chain protection were used. The 
Fmoc-amino acids were activated for coupling as 
pentafluorophenyl esters or by using either TBTU or 
diisopropyl carbodiimide activation prior to coupling. 
20% piperidine in DMF was used for selective removal of 
Fmoc after each coupling. Cleavage from the resin and 
final removal of side chain protection was performed by 
95% TFA containing appropriate scavengers. The peptides 
were purified and analysed by reversed phase (C18) HPLC. 
The identity of the peptides was confirmed by using 
electro-spray mass spectroscopy (Finnigan mat SSQ710) . 

The peptides used for in vitro studies of T cell 
stimulation were synthesised by this method. 

Several other well known methods can be applied by a person 
skilled in the art to synthesise the peptides. 
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Examples of the method for determining new frameshift 
mutation peptides . 

In this Example, the BAX gene is used to illustrate the 
principle. 

In each of the steps listed below, the 1st line is the gene 
sequence and 2nd line is amino acid sequence. 
In the steps 2-5, the outlined sequences represent the 
mutant part of the protein. 

Step one: 
Normal BAX. 

A TG GGG GGG GA G GCA CCC GAG CTG GCC CTG GAC CCG GTG 

M G G E A P E L A L D P V... 

Step two: 

1G deleted from gene sequence. 

ATG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC CTC 

M G G RHPSWPWTRCL 

AGG ATG CGT CCA CCA AGA AGC TGA 
R M R P P R s stop 

Step three: 

2G deleted from gene sequence. 

ATG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
MG GGTRAGPGPGA 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
SGCVHQEAERVSQA 

CAT CGG GGA CGA ACT GGA CAG TAA 
H R G R T G Q stop 
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Step four: 

1G inserted in gene sequence. 

ATG GGG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
MGGGGTRAGPGPGA 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
SGCVHQEAERV SQA 

CAT CGG GGA CGA ACT GGA CAG TAR. 
H R G R T G Q stop 



Step five: 

2G inserted in gene sequence. 

ATG GGG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC 
MGGGRHPSWPWTRC 

CTC AGG ATG CGT CCA CCA AGA AGC TGA 
LRMRPPRS stop 



In the next Example, the TGFpRII gene is used to illustrate 
the principle. 

In each of the steps listed below, the 1st line is the gene 
sequence and 2nd line is amino acid sequence. 
In the steps 2-5, the outlined sequences represent the 
mutant part of the protein. 



Step one: 
Normal TGFPRII. 

GAA AAA AAA AA G CCT GGT GAG ACT TTC TTC ATG TGT TCC 

E K K K P G E T F F M C S... 
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Step two: 

1A deleted from gene sequence* 

GAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT GTA 
EKKSLVRLSSCVPV 

GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG AAT 
ALMSAMTTSSSQKN 

ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 
ITPAILTCC stop 

Step three: 

2A deleted from gene sequence. 

GAA AAA AAA GCC TGG TGA 
E K K A W stop 

Step four: 

1A inserted in gene sequence. 

GAA AAA AAA AAA GCC TGG TGA 
E K K K A W stop 

Step five: 

2A inserted in gene sequence. 

GAA AAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT 
EKKKSLVRLSSCVP 

GTA GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG 
V A I* M S AM T T S S S QK 

AAT ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 
NITPAILTCC stop 

Thus the peptides of the invention may be used in a 
method for the treatment of cancers with cancer cells 
harbouring genes with frameshift mutations, which 
treatment comprises administering at least one peptide of 
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the present invention in vivo or ex vivo to a human 
patient in need of such treatment. 

In another embodiment the peptides of the invention may 
5 be used to vaccinate a human being disposed for cancers 
with cancer cells harbouring genes with frameshift 
mutations/ by administering at least one peptide of the 
present invention to said human being. 

10 It is further considered to be an advantage to administer 
to a human individual a mixture of the peptides of this 
invention, whereby each of the peptides of the invention 
can bind to different types of HLA class I and/or class 
II molecules of the individual. 

It is further anticipated that the power of an anticancer 
vaccine or peptide drug as disclosed in the above 
mentioned PCT/NO92/00032 application, can be greatly 
enhanced if the peptides of the present invention were 
20 included. Thus in another embodiment of the present 
invention peptides of the present invention are 
administered together with, either simultaneously or in 
optional sequence, with the peptides disclosed in 
PCT/NO92/00032. 

It is considered that the peptides may be administered 
together, either simultaneously or separately, with 
compounds such as cytokines and/or growth factors, i.e. 
interleukin-2 (IL-2), interleukin-12 (IL-12) , granulocyte 
30 macrophage colony stimulating factor (GM-CSF) , Flt-3 
ligand or the like in order to strengthen the immune 
response as known in the art. 

The peptides according to the present invention can be 
35 used in a vaccine or a therapeutical composition either 

alone or in combination with other materials, such as for 
instance standard adjuvants or in the form of a 
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lipopeptide conjugate which as known in the art can 
induce high-affinity cytotoxic T lymphocytes, (K. Deres, 
Nature, Vol.342, (nov.1989)). 

The peptides according to the present invention may be 
useful to include in either a peptide or recombinant 
fragment based vaccine. 

The peptides according to the present invention can be 
included in pharmaceutical compositions or in vaccines 
together with usual additives, diluents, stabilisers or 
the like as known in the art. 

According to this invention, a pharmaceutical composition 
or vaccine may include the peptides alone or in 
combination with at least one pharmaceutically acceptable 
carrier or diluent. 

Further a vaccine or therapeutical composition can 
comprise a selection of peptides which are fragments of 
the mutant proteins arising from insertion or deletion of 
bases in a repeat sequence of the gene. 

Further a vaccine composition can comprise at least one 
peptide selected for one cancer, which vaccine would be 
administered to a person carrying a genetic disposition 
for this particular cancer. 

Further a vaccine composition can comprise at least one 
peptide selected for one cancer, which vaccine would be 
administered to a person belonging to a high risk group 
for this particular cancer. 

The cancer vaccine according to this invention may 
further be administered to the population in general for 
example as a mixture of peptides giving rise to T cell 
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immunity against various common cancers connected with 
frameshift mutation genes. 

The peptides according to this invention may be 
administered as single peptides or as a mixture of 
peptides. Alternatively the peptides may be covalently 
linked with each other to form larger polypeptides or 
even cyclic polypeptides. 

A cancer therapy according to the present invention may 
be administered both in vivo or ex vivo having as the 
main goal the raising of specific T cell lines or clones 
against the mutant gene product associated with the 
cancer type with which the patient is afflicted. 

Further, the frameshift mutant peptides of this invention 
may be administered to a patient by various routes 
including but not limited to subcutaneous, intramuscular, 
intradermal, intraperitoneal, intravenous or the like. In 
one embodiment the peptides of this invention are 
administered intradermally . The peptides may be 
administered at single or multiple injection sites to a 
patient in a therapeutically or prophylactically 
effective amount. 

The peptides of this invention may be administered only 
once or alternatively several times, for instance once a 
week over a period of 1-2 months with a repeated sequence 
later all according to the need of the patient being 
treated. 

The peptides of this invention can be administered in an 
amount in the range of 1 microgram (1 ng) to 1 gram (lg) 
to an average human patient or individual to be 
vaccinated. It is preferred to use a smaller dose in the 
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rage of 1 microgram (1 ^ig) to 1 milligram (1 mg) for each 
administration . 

The invention further encompasses DNA sequences which 
encodes a frameshift mutation peptide. 

The invention additionally encompasses isolated DNA 
sequences comprising a DNA sequence encoding at least one 
frameshift mutant peptide, and administration of such 
isolated DNA sequences as a vaccine for treatment or 
prophylaxis of cancers associated with frameshift 
mutations in the genes. 

The peptides according to this invention may be 
administered to an individual in the form of DNA vaccines. 
The DNA encoding these peptides may be in the form of 
cloned plasmid DNA or synthetic oligonucleotide. The DNA 
may be delivered together with cytokines, such as IL-2, 
and/or other co-stimulatory molecules. The cytokines 
and/or co-stimulatory molecules may themselves be delivered 
in the form of plasmid or oligonucleotide DNA. The response 
to a DNA vaccine has been shown to be increased by the 
presence of immunostimulatory DNA sequences (ISS) . These 
can take the form of hexameric motifs containing methylated 
CpG, according to the formula: 

5 f -purine-purine-CG-pyrimidine-pyrimidine-3' . Our DNA 
vaccines may therefore incorporate these or other ISS, in 
the DNA encoding the peptides, in the DNA encoding the 
cytokine or other co-stimulatory molecules, or in both. A 
review of the advantages of DNA vaccination is provided by 
Tighe et al (1998, Immunology Today, 19(2), 89-97). 

In one embodiment, the DNA sequence encoding the mutant 
BAX peptides comprises: 
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Normal BAX. 

R TG GGG GGG GA G GCA CCC GAG CTG GCC CTG GAC CCG GTG 

1G deleted from BAX gene sequence. 

ATG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC CTC 
AGG ATG CGT CCA CCA AGA AGC TGA 

2G deleted from BAX gene sequence. 

ATG GGG GGA GGC ACC CGA OCT GGC CCT GGA CCC GGT GCC 
TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
CAT CGG GGA CGA ACT GGA CAG TAA 

1G inserted in BAX gene sequence. 

ATG GGG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
CAT CGG GGA CGA ACT GGA CAG TAA 

2G inserted in BAX gene sequence. 

ATG GGG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC 
CTC AGG ATG CGT CCA CCA AGA AGC TGA 

In a second embodiment, the DNA sequence encoding the 
mutant TGFpRII peptides comprises: 

Normal TGF|5RII gene. 

GAAAAA_AAA_AAG CCT GGT GAG ACT TTC TTC ATG TGT TCC. . . . 
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1A deleted from TGF0RII gene sequence. 

GAA AAA AAA AGC CTG GTG AGA CTT TCT TCA TGT GTT CCT GTA 
GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG AAT 
ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 



2A deleted from TGF0RII gene sequence. 
GAA AAA AAA GCC TGG TGA 



1A inserted in TGF0RII gene sequence. 
GAA AAA AAA AAA GCC TGG TGA 



2A inserted in TGF0RII gene sequence. 

GAA AAA AAA AAA AGC CTG GTG AGA CTT TCT TCA TGT GTT CCT 
GTA GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG 
AAT ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 



The invention further encompasses vectors and plasmids 
comprising a DNA sequence encoding a frameshift mutant 
peptide. The vectors include, but are not limited to 
E.Coli plasmid, a Listeria vector and recombinant viral 
vectors. Recombinant viral vectors include, but are not 
limited to orthopox virus, canary virus, capripox virus, 
suipox virus, vaccinia, baculovirus, human adenovirus, 
SV4 0, bovine papilloma virus and the like comprising the 
DNA sequence encoding a frameshift mutant peptide. 

It is considered that an anticancer treatment or 
prophylaxis may be achieved also through the 
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administration of an effective amount of a recombinant 
virus vector or plasmid comprising at least one insertion 
site containing a DNA sequence encoding a frameshift 
mutant peptide to a patient, whereby the patient's 
antigen presenting cells are turned into host cells for 
the vector/plasmid and presentation of HLA/ frameshift 
mutation peptide complex is achieved. 

A person skilled in the art will find other possible use 
combinations with the peptides of this invention, and 
these are meant to be encompassed by the present claim. 

The peptides according to this invention may be produced 
by conventional processes as known in the art, such as 
chemical peptide synthesis, recombinant DNA technology or 
protease cleavage of a protein or peptide encoded by a 
f rameshift mutated gene. One method for chemical 
synthesis is elucidated in the description below. 

in order for a cancer vaccine and methods for specific 
cancer therapy based on specific T cell immunity to be 
effective, three conditions must be met: 
1 The peptides used must correspond, either in their 
full length or after processing by antigen presenting 
cells, to the processed mutant protein fragment as 
presented by a HLA Class I and/or class II molecule on 
the cancer cell or other antigen presenting cells, 
2. The peptides used must be bound to a HLA Class I 
and/or Class II molecule in an immunogenic form, and 
3 T-cells capable of recognising and responding to the 
HLA/peptide complex must be present in the circulation of 
the human being. 

It has been established that all these conditions are met 
for some representative peptides according to the present 
invention. The peptides according to the present 
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invention give rise to specific T cell immune responses 
in vitro. It has been established that the peptides 
according to this invention correspond to processed 
mutant protein fragments. This is exemplified with 
peptides corresponding to fragments of transformed mutant 
BAX and TGFpRII peptides. 

Through the present invention the following advantages 
are achieved: 

- It offers a possibility to treat patients suffering 
from cancers arising from frame-shift mutations in their 
genes , most of which cancers known at present do not have 
any good treatment alternatives. 

- It offers a possibility to vaccinate prophylaxtically 
humans carrying genetic dispositions or belonging to 
other high risk groups. 

- It offers a possibility to prepare a combination 
treatment for a specific cancer, such as for instance 
colorectal or pancreatic cancers, wherein the cancer 
commonly is associated with either a frameshift mutation 
or a point mutation in the genes. 

-Since described frameshift mutations occurs in a large 
variety of cancers it will be possible to use this 
peptides in combination with established vaccines and 
future vaccines to obtain a multiple targetting 
treatment . 

-Likewise patients suffering from cancers associated with 
multiple frameshift mutations in genes can be treated 
more efficiently through a combination treatment. 
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Claims 

1 . A peptide characterised in that it 

a) is at least 8 amino acids long and is a 
fragment of a mutant protein arising from a 
frameshift mutation in a gene of a cancer cell; 

and 

b) consists of at least one amino acid of the 
mutant part of a protein sequence encoded by said 
gene ; 

and 

c) comprises 0-10 amino acids from the carboxyl 
terminus of the normal part of the protein 
sequence preceding the amino terminus of the 
mutant sequence and may further extend to the 
carboxyl terminus of the mutant part of the 
protein as determined by a new stop codon 
generated by the frameshift mutation; 

and 

d) induces, either in its full length or after 
processing by antigen presenting cell, T cell 
responses . 

2. A peptide according to claim 1 characterised in that 
contain 8-25 amino acids. 

3. A peptide according to claim 1 characterised in that 
contain 9-20 amino acids. 
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4. A peptide according to claim 1 characterised in that it 
contain 9-16 amino acids. 

5. A peptide according to claim 1 characterised in that it 
contain 8-12 amino acids. 

6. A peptide according to claim 1 characterised in that it 
contain 20-25 amino acids. 

7 • A peptide according to claim 1 characterised in that it 
contains 9 amino acids . 

8. A peptide according to claim 1 characterised in that it 
contains 12 amino acids . 

9. A peptide according to claim 1 characterised in that it 
contains 13 amino acids . 

10. A peptide according to claim 1 characterised in that it 
is a fragment of a mutant protein encoded by a frameshift 
mutation in BAX gene or TGFpRII gene. 
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11. A peptide according to claim 1 characterised in that it 
is a fragment of a mutant protein encoded by a frameshift 
mutation in hTGF02 gene, DCC gene, BRCA1 gene, BRCA2 gene, 
hPTP gene, to P 2 gene, TTK gene, CTCF gene, Human 
FADD-homologous ICE/CED-3-like protease gene, hMSH3 gene, 
hRBPl gene, hFMRl gene, Human TINUR gene, b-raf oncogene, 
NF1 gene, Human germline n-myc gene, Human n-myc gene, Human 
ras inhibitor gene, hMSH6 gene, Human nasopharynx carcinoma 
EBV BNLF-1 gene, Human cell cycle regulatory protein 
(ElA-binding protein) P 300 gene, bcl-3) gene, BIGH3, Human 
transcription factor ETV1 gene, IGFBP4 gene, Human MUC1 
gene, JAK1 gene, JAK3 gene, Human Flt4 gene, Human p53 
associated gene, hCAN gene, hDBL proto-oncogene/hMCF2PO 
gene, hDEK gene, P 107 gene, hGPRl gene, hRBP56 gene, hITF-2 
gene, hKiSS-1 gene, hTP-1 gene, hFDF-5 gene, hMTAl gene, 
hTFIIB90 gene, hLUCA-1 gene, Human Wilm's tumour (WIT-1) 
associated protein, ICErel-III gene, FasL gene,BARDl gene, 
hMCF.2 gene, fas gene and Human DPC4 gene. 

12. A peptide according to claim 1 characterised in that it 
is selected from a group of peptides having the following 
sequence identity numbers: 

seq. id. nos. 1-21 and seq id no. 428 or a fragment of any 
of these. 

13. A peptide according to claim 1 characterised in that it 
is selected from a group of peptides having the following 
sequence identity numbers: 

seq. id. nos. 22-427 and seq. id. nos. 429-437 or a 
fragment of any of these. 

14. A pharmaceutical composition comprising a peptide 
according to any of the above claims and a pharmaceutical^ 
acceptable carrier or diluent. 
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15. A cancer vaccine comprising a peptide according to any 
of the claims 1-13 and a pharmaceutically acceptable 
carrier or diluent* 

16. Use of a peptide according to any of the claims 1-13 
for the preparation of a pharmaceutical composition for 
treatment or prophylaxis of cancer. 

17. Method for vaccination of a person disposed for or 
afflicted with cancer, consisting of administering at least 
one peptide according to the claims 1-13, one or more 
times, in an amount sufficient for induction of specific 
T-cell immunity to the mutant proteins or fragments thereof 
encoded by a frameshift mutated gene. 

18. Method according to claim 17 wherein the amount of the 
peptides is in the range of 1 microgram (1 ng) to 1 gram 

(lg) and preferentially in the rage of 1 microgram (1 ng) 
to 1 milligram (1 mg) for each administration. 

19. Method for treatment of a patient afflicted with cancer 
by stimulating in vivo or ex vivo with peptides according 
to the claims 1-13. 

20. Method according to claim 19 wherein the amount of the 
peptides used is in the range of 1 microgram (1 ng) to 1 
gram (lg) and preferentially in the rage of 1 microgram (1 
lig) to 1 milligram (1 mg) for each administration. 

21. A pharmaceutical composition or vaccine composition 
comprising a combination of at least one peptide according 
to claims 1-13 and at least one peptide according to 
PCT/NO92/00032. 
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22 A method for identifying new peptides which correspond 
to fragments of proteins arising from frameshift mutations 
in genes, characterised by the following steps: 

1) identifying a gene in a cancer cell susceptible 
to frameshift mutation by having a mono nucleoside 
base repeat sequence of at least five residues, or 
a di-nucleoside base repeat sequence of at least 
four di-nucleoside base units; 

and 

2) removing, respectively, one nucleoside base 
residue or one di-nucleoside base unit from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/ or 

3) removing, respectively, two nucleoside base 
residues or two di-nucleoside base units from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

and/ or 

4) inserting, respectively, one nucleoside base 
residue or one di-nucleoside base unit in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 



and/ or 
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5) inserting, respectively, two nucleoside base 
residues or two di-nucleoside base units in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon. 



23. A method according to claim 22, 
charactersed it that it includes the 
following steps: 

6) determining whether the new peptides, either in 
their full length or as shorter fragments of the 
peptides, are able to stimulate T cells; 

and optionally 

7) determining peptides containing nested epitopes 
for different major HLA class I and/or HLA class II 
molecules . 

24. An isolated DNA sequence comprising a DNA sequence or 
variants thereof encoding a frameshift mutant peptide 
according to claim 1. 

25. An isolated DNA sequence encoding peptides comprising 
seq. id* nos. 1-21 and seq. id. no. 428 or variants 
thereof* 

26. An isolated DNA sequence encoding peptides comprising 
seq. id. nos. 22-427 and seq. id. nos. 429-437 or variants 
thereof. 
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27. Use of a DNA sequence according to any of the claims 
24-26 for the preparation of a pharmaceutical composition 
for treatment or prophylaxis of cancer . 

5 28. Method for treatment of a person disposed for or 

afflicted with cancer, by stimulating in vivo or ex vivo 
with DNA sequences according to the claims 24-26. 
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29. A plasmid or virus vector comprising the DNA sequence 
of claim 24 encoding a frameshift mutant peptide. 



30. A vector according to claim 29 wherein the vector is 
E.Coli plasmid, a Listeria vector and recombinant viral 
vectors. Recombinant viral vectors include, but are not 

15 limited to orthopox virus, canary virus, capripox virus, 

suipox virus, vaccinia, baculovirus, human adenovirus, SV40 
or bovine papilloma virus. 

31. Use of a plasmid or virus vector according to claim 29 
20 for the preparation of a pharmaceutical composition for 

treatment or prophylaxis of cancer. 

32. Method for treatment of a person disposed for or 
afflicted with cancer, by stimulating in vivo or ex vivo 

25 with plasmids or virus vectors according to claim 29. 



Abstract 



Peptides from oncogene protein products of frameshift 
mutated genes which eliciting T cellular immunity for use 
in cancer vaccines and compositions for anticancer 
treatment . 



Sequence identity list 



SF. pTTENCE LISTING 

rnMMON FO R ALL SEQUENCES. 
SEQUENCE TYPE: Peptide 
SEQUENCE UNIT: Amino Acid 
TOPOLOGY: Linear 



SEQUENCE ID NO: 1 

SEQUENCE LENGTH: 18 amino acids 
RHPSWPWTRCLRMRPPRS 

! 5 10 15 

SEQUENCE ID NO: 2 

SEQUENCE LENGTH: 31 amino acids 

GTRAGPGPGASGCVHQEAERVSQAHRGRTG 
! 5 10 15 20 25 30 

Q 

SEQUENCE ID NO: 3 

SEQUENCE LENGTH: 32 amino acids 

GGTRAGPGPGASGCVHQEAERVSQAHRGRT 
± 5 10 15 20 25 30 

G Q 

SEQUENCE ID NO: 4 

SEQUENCE LENGTH: 19 amino acids 
GRHPSWPWTRCLRMRPPRS 

15 10 15 



SEQUENCE ID NO: 5 

SEQUENCE LENGTH: 28 amino acids 

I Q 



DRAGRMGGRHPSWPWTRCLRMRPPRS 



1 5 10 15 20 25 
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SEQUENCE ID NO: 6 

SEQUENCE LENGTH: 19 amino acids 

IQDRAGRMGGGRHPSWPWT 
15 10 15 

SEQUENCE ID NO: 7 

SEQUENCE LENGTH: 42 amino acids 

IQDRAGRMGGGGTRAGPGPGASGCVHQEAE 
1 5 10 15 20 25 30 

RVSQAHRGRTGQ 
35 40 

SEQUENCE ID NO: 8 

SEQUENCE LENGTH: 19 amino acids 
IQDRAGRMGGGTRAGPGPG 
15 10 15 

SEQUENCE ID NO: 9 

SEQUENCE LENGTH: 22 amino acids 

IQDRAGRMGGRHPSWPWTRCLR 

15 10 15 20 

SEQUENCE ID NO: 10 

SEQUENCE LENGTH: 22 amino acids 

ASGCVHQEAERVSQAHRGRTGQ 

15 10 15 20 

SEQUENCE ID NO: 11 

SEQUENCE LENGTH: 22 amino acids 
GGTRAGPGPGASGCVHQEAERV 
15 10 15 20 

SEQUENCE ID NO: 12 

SEQUENCE LENGTH: 22 amino acids 



J 



IQDRAGRMGGGGTRAGPGPGAS 
1 5 10 15 20 

SEQUENCE ID NO: 13 

SEQUENCE LENGTH: 34 amino acids 

SLVRLSSCVPVALMSAMTTSSSQKNITPAI 
1 5 10 15 20 25 30 

L T C C 

SEQUENCE ID NO: 14 

SEQUENCE LENGTH: 44 amino acids 

SPKCIMKEKKSLVRLSSCVPVALMSAMTTS 
1 5 10 15 20 25 30 

SSQKNITPAILTCC 
35 40 

SEQUENCE ID NO: 15 

SEQUENCE LENGTH: 19 amino acids 
PKCIMKEKKKSLVRLSSCV 

1 5 10 15 

SEQUENCE ID NO: 16 

SEQUENCE LENGTH: 23 amino acids 

ALMSAMTTSSSQKNITPAILTCC 
15 10 15 20 

SEQUENCE ID NO: 17 

SEQUENCE LENGTH: 23 amino acids 

SLVRLSSCVPVALMSAMTTSSSQ 
15 10 15 20 

SEQUENCE ID NO: 18 

SEQUENCE LENGTH: 22 amino acids 
SPKCIMKEKKSLVRLSSCVPVA 

15 10 15 20 



SEQUENCE ID NO: 19 

SEQUENCE LENGTH: 12 amino acids 
SPKCIMKEKKAW 
15 10 

SEQUENCE ID NO: 20 

SEQUENCE LENGTH: 12 amino acids 
PKCIMKEKKKAW 
15 10 

SEQUENCE ID NO: 21 

SEQUENCE LENGTH: 19 amino acids 

AMTTSSSQKNITPAILTCC 
15 10 15 

SEQUENCE ID NO: 22 

SEQUENCE LENGTH: 9 amino acids 

TVGRPHISC 

1 5 

SEQUENCE ID NO: 23 

SEQUENCE LENGTH: 10 amino acids 

KTVGRPHISC 

15 10 

SEQUENCE ID NO: 24 

SEQUENCE LENGTH: 18 amino acids 
KQWEDPTSPANVIALLQT 
15 10 15 

SEQUENCE ID NO: 25 

SEQUENCE LENGTH: 17 amino acids 
QWEDPTSPANVIALLQT 
15 10 15 



SEQUENCE ID NO: 26 



SEQUENCE LENGTH: 19 amino acids 
QKT IKS TRKKTVGRPHISC 
1 5 10 15 

SEQUENCE ID NO: 27 

SEQUENCE LENGTH: 20 amino acids 

QKT IKSTRKKKTVGRPHISC 

15 10 15 20 

SEQUENCE ID NO: 28 

SEQUENCE LENGTH: 28 amino acids 

QKT IKS TRKKKQWEDPTSPANVIALLQT 
1 5 10 15 20 25 

SEQUENCE ID NO: 29 

SEQUENCE LENGTH: 27 amino acids 

QKTIKSTRKKQWEDPTSPANVIALLQT 
1 5 10 15 20 25 

SEQUENCE ID NO: 30 

SEQUENCE LENGTH: 34 amino acids 

AADLQQQFVHFLDCWDVSSIPFTLHLPQAQ 
1 5 10 15 20 25 3 

D I T T 

SEQUENCE ID NO: 31 

SEQUENCE LENGTH: 9 amino acids 

GKDAKEKSS 

1 5 

SEQUENCE ID NO: 32 

SEQUENCE LENGTH: 10 amino acids 

GKDAKEKKSS 

1 5 10 



SEQUENCE ID NO: 33 



SEQUENCE LENGTH: 42 amino acids 

GKDAKEKKAADLQQQFVHFLDCWDVSSIPF 
1 5 10 15 20 25 30 

TLHLPQAQDITT 
35 40 

SEQUENCE ID NO: 34 

SEQUENCE LENGTH: 41 amino acids 

GKDAKEKAADLQQQFVHFLDCWDVSSIPFT 
1 5 10 15 20 25 30 

LHLPQAQDITT 
35 40 



SEQUENCE ID NO: 35 

SEQUENCE LENGTH: 9 amino acids 

FSMKQTLMNVKNLKTK 

15 10 15 

SEQUENCE ID NO: 36 

SEQUENCE LENGTH: 17 amino acids 
KFSMKQTLMNVKNLKTK 
15 10 15 



SEQUENCE ID NO: 37 

SEQUENCE LENGTH: 25 amino 
VRTSKTRKKFSMK 
1 5 io 

SEQUENCE ID NO: 38 

SEQUENCE LENGTH: 26 amino 
VRTSKTRKKKFSM 
1 5 io 

SEQUENCE ID NO: 39 

SEQUENCE LENGTH: 12 amino 
VRTSKTRKKNFP 



acids 

QTLMNVKNLKTK 
15 20 25 

acids 

KQTLMNVKNLKTK 
15 20 25 

acids 



SEQUENCE ID NO: 40 

SEQUENCE LENGTH: 11 amino acids 
VRTSKTRKNFP 

1.5 10 

SEQUENCE ID NO: 41 

SEQUENCE LENGTH: 10 amino acids 

IKKKLLQFQK 

15 10 

SEQUENCE ID NO: 42 

SEQUENCE LENGTH: 11 amino acids 
KIKKKLLQFQK 
15 10 

SEQUENCE ID NO: 43 

SEQUENCE LENGTH: 17 amino acids 
KSRRNYFNFKNNCQSRL 
15 10 15 

SEQUENCE ID NO: 44 

SEQUENCE LENGTH: 16 amino acids 

SRRNYFNFKNNCQSRL 

1 5 10 15 

SEQUENCE ID NO: 45 

SEQUENCE LENGTH: 18 amino acids 
TNLRVIQKIKKKLLQFQK 
15 10 15 

SEQUENCE ID NO: 46 

SEQUENCE LENGTH: 19 amino acids 
TNLRVI QKKI KKKLLQFQK 
15 10 15 



SEQUENCE ID NO: 47 

SEQUENCE LENGTH: 25 amino acids 

TNLRVIQKKSRRNYFNFKNNCQSRL 
1 5 10 15 20 25 

SEQUENCE ID NO: 48 

SEQUENCE LENGTH: 24 amino acids 

TNLRVIQKSRRNYFNFKNNCQSRL 
15 10 15 20 

SEQUENCE ID NO: 49 

SEQUENCE LENGTH: 5 amino acids 
K I M I T 
1 5 

SEQUENCE ID NO: SO 

SEQUENCE LENGTH: 12 amino acids 

NIDKIPEKIMIT 

1 5 io 



SEQUENCE ID NO: 51 

SEQUENCE LENGTH: 13 amino acids 

NIDKIPEKKIMIT 
15 io 

SEQUENCE ID NO: 52 

SEQUENCE LENGTH: 5 amino acids 
I I N A N 
1 5 



SEQUENCE ID NO: 53 

SEQUENCE LENGTH: 6 amino acids 
K I I N A N 
1 5 



SEQUENCE ID NO: 54 

SEQUENCE LENGTH: 13 amino acids 
NDKTVSEKI INAN 
15 10 

SEQUENCE ID NO: 55 

SEQUENCE LENGTH: 14 amino acids 
NDKTVSEKKI INAN 
15 10 

SEQUENCE ID NO: 56 

SEQUENCE LENGTH: 14 amino acids 
NGLEKEYLMVNQKE 
15 10 

SEQUENCE ID NO: 57 

SEQUENCE LENGTH: 23 amino acids 

SQTSLLEAKNGLEKEYLMVNQKE 
15 10 15 20 

SEQUENCE ID NO: 58 

SEQUENCE LENGTH: 24 amino acids 

SQTSLLEAKKNGLEKEYLMVNQKE 
15 10 15 20 

SEQUENCE ID NO: 59 

SEQUENCE LENGTH: 12 amino acids 
SQTSLLEAKKMA 
15 10 

SEQUENCE ID NO: 60 

SEQUENCE LENGTH: 11 amino acids 

SQTSLLEAKMA 

1 5 10 



SEQUENCE ID NO: 61 



SEQUENCE LENGTH: 6 amino acids 
T L V F P K 
1 5 



SEQUENCE ID NO: 62 

SEQUENCE LENGTH: 7 amino acids 
K T L V F P K 
1 5 

SEQUENCE ID NO: 63 

SEQUENCE LENGTH: 14 amino acids 

LKNVEDQKTLVFPK 
15 io 

SEQUENCE ID NO: 64 

SEQUENCE LENGTH: 15 amino acids 

LKNVEDQKKTLVFPK 
1 5 io 15 

SEQUENCE ID NO: 65 

SEQUENCE LENGTH: 10 amino acids 

LKNVEDQKKH 

1 5 io 

SEQUENCE ID NO: 66 

SEQUENCE LENGTH: 9 amino acids 

LKNVEDQKH 

1 5 

SEQUENCE ID NO: 67 

SEQUENCE LENGTH: 6 amino acids 
K K I Q L Y 
1 5 

SEQUENCE ID NO: 68 

SEQUENCE LENGTH: 7 amino acids 



K K K I Q L Y 
1 5 



SEQUENCE ID NO: 69 

SEQUENCE LENGTH: 36 amino acids 
RKRFSYTEYLASI IRFI 
15 10 15 

S C N F K I 
35 

SEQUENCE ID NO: 70 

SEQUENCE LENGTH: 15 amino acids 
LRIVSYSKKKKIQLY 
1 5 10 15 

SEQUENCE ID NO: 71 

SEQUENCE LENGTH: 16 amino acids 
LRIVSYSKKKKKIQLY 
15 10 15 

SEQUENCE ID NO: 72 

SEQUENCE LENGTH: 45 amino acids 
LRIVSYSKKRKRFSYTEYLAS I IRFI FSVN 
1 5 10 15 20 25 3 

RRKEIQNLSSCNFKI 
35 40 45 



FSVNRRKEIQNLS 
20 25 30 



SEQUENCE ID NO: 73 

SEQUENCE LENGTH: 44 amino acids 

LRIVSYSKRKRFSYTEYLASIIRFIFSVNR 
1 5 10 15 20 25 

30 

RKEIQNLSSCNFKI 



SEQUENCE ID NO: 74 

SEQUENCE LENGTH: 18 amino acids 
QDLPLSSICQTIVTIYWQ 
15 10 15 

SEQUENCE ID NO: 75 

SEQUENCE LENGTH: 19 amino acids 
KQDLPLSSICQTIVTIYWQ 
15 10 15 



SEQUENCE ID NO: 76 

SEQUENCE LENGTH: 25 amino acids 

NRTCPFRLFVRRMLQFTGNKVLDRP 
15 10 15 20 25 

SEQUENCE ID NO: 77 

SEQUENCE LENGTH: 27 amino acids 

GFVVSVVKKQDLPLSSICQTIVTIYWQ 
1 5 10 15 20 25 

SEQUENCE ID NO: 78 

SEQUENCE LENGTH: 28 amino acids 

GFVVSVVKKKQDLPLSSICQTIVTIYWQ 
1 5 10 15 20 25 

SEQUENCE ID NO: 79 

SEQUENCE LENGTH: 34 amino acids 

GFVVSVVKKNRTCP FRLFVRRMLQFTGNK 

1 5 10 15 20 25 

L D R P 



SEQUENCE ID NO: 80 

SEQUENCE LENGTH: 33 amino acids 

GFVVSVVKNRTCPFRLFVRRMLQFTGN 

1 5 10 15 20 25 

D R P 



SEQUENCE ID NO: 81 

SEQUENCE LENGTH: 8 amino acids 

YRKTKNQN 

1 5 

SEQUENCE ID NO: 82 

SEQUENCE LENGTH: 9 amino acids 

KYRKTKNQN 

1 5 

SEQUENCE ID NO: 83 

SEQUENCE LENGTH: 10 amino acids 

NTERPKIRTN 

15 10 

SEQUENCE ID NO: 84 

SEQUENCE LENGTH: 17 amino acids 
DETFYKGKKYRKTKNQN 
15 10 15 

SEQUENCE ID NO: 85 

SEQUENCE LENGTH: 18 amino acids 
DET FYKGKKKYRKTKNQN 
15 10 15 

SEQUENCE ID NO: 86 

SEQUENCE LENGTH: 19 amino acids 
DETFYKGKKNTERPKIRTN 
15 10 15 

SEQUENCE ID NO: 87 

SEQUENCE LENGTH: 18 amino acids 
DET FYKGKNTERPKIRTN 
15 10 15 



SEQUENCE ID NO: 88 

SEQUENCE LENGTH: 28 amino acids 

LS INNYRFQMKFYFRFTSHGS PFTSANF 
15 10 15 20 25 

SEQUENCE ID NO: 89 

SEQUENCE LENGTH: 29 amino acids 

KLS INNYRFQMKFYFRFTSHGSPFTSANF 
15 10 15 20 25 

SEQUENCE ID NO: 90 

SEQUENCE LENGTH: 10 amino acids 

NSVSTTTGFR 

15 10 

SEQUENCE ID NO: 91 

SEQUENCE LENGTH: 37 amino acids 

NIQLAATKKLS INNYRFQMKFYFRFTSHGS 
1 5 10 15 20 25 30 

PFTSANF 
35 

SEQUENCE ID NO: 92 

SEQUENCE LENGTH: 38 amino acids 

NIQLAATKKKLS INNYRFQMKFYFRFTSHG 
1 5 10 15 20 25 30 

SPFTSANF 
35 

SEQUENCE ID NO: 93 

SEQUENCE LENGTH: 19 amino acids 
NIQLAATKKNSVSTTTGFR 
15 10 15 

SEQUENCE ID NO: 94 

SEQUENCE LENGTH: 18 amino acids 



NIQLAATKNSVSTTTGFR 
15 10 15 



SEQUENCE ID NO: 95 

SEQUENCE LENGTH: 18 amino acids 
MEHVAPGRMSASPQSPTQ 
1 5 10 15 

SEQUENCE ID NO: 96 

SEQUENCE LENGTH: 19 amino acids 
KMEHVAPGRMSASPQS PTQ 
1 5 10 15 

SEQUENCE ID NO: 97 

SEQUENCE LENGTH: 59 amino acids 

KWSTWLQAECQHLHSPQRSDKPQQAGLDQQ 

1 5 10 15 20 25 3 

HHCFALDSSPGPRPVFLQLLGLMGQGRHD 

35 40 45 50 55 

SEQUENCE ID NO: 98 

SEQUENCE LENGTH: 58 amino acids 

WSTWLQAECQHLHSPQRSDKPQQAGLDQQH 

15 10 15 20 25 3 

HCFALDSSPGPRPVFLQLLGLMGQGRHD 

35 40 45 50 55 

SEQUENCE ID NO: 99 

SEQUENCE LENGTH: 26 amino acids 

T FSVWAEKMEHVAPGRMSAS PQS PTQ 
1 5 10 15 20 25 

SEQUENCE ID NO: 100 

SEQUENCE LENGTH: 27 amino acids 

TFSVWAEKKMEHVAPGRMSASPQS PTQ 
1 5 10 15 20 25 



CQHLHSPQRSDK 
20 25 

PGPRPVFLQLLG 
50 55 



SEQUENCE ID NO: 102 

SEQUENCE LENGTH: 66 amino acids 

TFSVWAEKWSTWLQAECQHLHS PQRSDKP 
1 5 10 15 20 25 

QAGLDQQHHCFALDSSPGPRPVFLQLLGLM 
35 40 45 50 55 6 

G Q G R H D 
65 

SEQUENCE ID NO: 103 

SEQUENCE LENGTH: 18 amino acids 
HKWLKFCLLRLVKES FHE 
1 5 10 15 

SEQUENCE ID NO: 104 

SEQUENCE LENGTH: 19 amino acids 
KHKWLKFCLLRLVKES FHE 
15 10 15 

SEQUENCE ID NO: 105 

SEQUENCE LENGTH: 27 amino acids 

KGGKAKGKKHKWLKFCLLRLVKES FHE 
1 5 10 15 20 25 

SEQUENCE ID NO: 106 

SEQUENCE LENGTH: 28 amino acids 



SEQUENCE ID NO: 101 

SEQUENCE LENGTH: 67 amino acids 

TFSVWAEKKWSTWLQAE 
1 5 10 15 

QQAGLDQQHHCFALDSS 
35 40 45 

M G Q G R H D 
65 



KGGKAKGKKKHKWLKFCLLRLVKESFHE 
1 5 10 15 20 25 

SEQUENCE ID NO: 107 

SEQUENCE LENGTH: 13 amino acids 

KGGKAKGKKNTNG 

1 5 10 

SEQUENCE ID NO: 108 

SEQUENCE LENGTH: 12 amino acids 

KGGKAKGKNTNG 

1 5 10 

SEQUENCE ID NO: 109 

SEQUENCE LENGTH: 8 amino acids 

VNNFFKKL 

1 5 

SEQUENCE ID NO: 110 

SEQUENCE LENGTH: 9 amino acids 

KVNNFFKKL 

1 5 

SEQUENCE ID NO: 111 

SEQUENCE LENGTH: 16 amino acids 
LSQGNVKKVNNFFKKL 
15 10 15 

SEQUENCE ID NO: 112 

SEQUENCE LENGTH: 17 amino acids 
LSQGNVKKKVNNFFKKL 
15 10 15 

SEQUENCE ID NO: 113 

SEQUENCE LENGTH: 38 amino acids 

GEKNDLQLFVMSDRRYKIYWTVI LLNPCGN 



1 5 10 15 20 25 30 

LHLKTTSL 

35 

SEQUENCE ID NO: 114 

SEQUENCE LENGTH: 39 amino acids 

KGEKNDLQLFVMSDRRYKIYWTVILLNPCG 
1 5 10 15 20 25 30 

NLHLKTTSL 
35 

SEQUENCE ID NO: 115 

SEQUENCE LENGTH: 10 amino acids 

KGKKMICSYS 

1 5 10 

SEQUENCE ID NO: 116 

SEQUENCE LENGTH: 9 amino acids 

GKKMICSYS 

1 5 

SEQUENCE ID NO: 117 

SEQUENCE LENGTH: 46 amino acids 

SSKTFEKKGEKNDLQLFVMSDRRYKI YWTV 
1 5 10 15 20 25 30 

ILLNPCGNLHLKTTSL 
35 40 45 

SEQUENCE ID NO: 118 

SEQUENCE LENGTH: 47 amino acids 

SSKTFEKKKGEKNDLQLFVMSDRRYKIYWT 
1 5 10 15 20 25 30 

VILLNPCGNLHLKTTSL 
35 30 45 



SEQUENCE ID NO: 119 



SEQUENCE LENGTH: 18 amino acids 
SSKTFEKKKGKKMICSYS 

15 10 15 



SEQUENCE ID NO: 120 

SEQUENCE LENGTH: 17 amino acids 
SSKTFEKKGKKMICSYS 
15 10 15 



SEQUENCE ID NO: 121 

SEQUENCE LENGTH: 17 amino acids 
QRKPKRANCVI QRRAKM 
15 10 15 



SEQUENCE ID NO: 122 

SEQUENCE LENGTH: 18 amino acids 
KQRKPKRANCVIQRRAKM 

i 5 10 15 



SEQUENCE ID NO: 123 

SEQUENCE LENGTH: 26 amino acids 

NKENQKEQTALLYRGGQRCRCVCL 
15 10 15 20 



SEQUENCE ID NO: 124 

SEQUENCE LENGTH: 26 amino acids 

PDYQPPAKKQRKPKRANCVIQRRA 
15 10 15 20 



SEQUENCE ID NO: 125 

SEQUENCE LENGTH: 27 amino acids 

PDYQPPAKKKQRKPKRANCVIQRRA 
1 5 10 15 20 2 



SEQUENCE ID NO: 126 

SEQUENCE LENGTH: 35 amino acids 



PDYQPPAKKNKENQKEQTALLYRGGQRCRC 
1 5 10 15 20 25 30 

V C L R F 



35 



SEQUENCE ID NO: 127 

SEQUENCE LENGTH: 34 amino acids 

PDYQPPAKNKENQKEQTALLYRGGQRCRCV 
1 5 10 15 20 25 30 



C L R F 



SEQUENCE ID NO: 128 

SEQUENCE LENGTH: 7 amino acids 
N L S S L L I 
1 5 



SEQUENCE ID NO: 129 

SEQUENCE LENGTH: 5 amino acids 
T C L P F 
1 5 



SEQUENCE ID NO: 130 

SEQUENCE LENGTH: 15 amino acids 
QPTFTLRKNLSSLLI 
1 5 10 15 

SEQUENCE ID NO: 131 

SEQUENCE LENGTH: 16 amino acids 
QPTFTLRKKNLSSLLI 
15 10 15 

SEQUENCE ID NO: 132 

SEQUENCE LENGTH: 14 amino acids 
QPTFTLRKKTCLPF 
15 10 



SEQUENCE ID NO: 133 

SEQUENCE LENGTH: 13 amino acids 
QPTFTLRKTCLPF 
1 5 10 

SEQUENCE ID NO: 134 
SEQUENCE LENGTH: 31 amino acids 

RATFLLSLWECSLPQARLCLIVSRTGLLVQ 

1 
S 



5 10 15 20 25 30 



SEQUENCE ID NO: 135 

SEQUENCE LENGTH: 19 amino acids 
GQHFYWHCGSAACHRRGCV 

1 5 10 15 

SEQUENCE ID NO: 136 

SEQUENCE LENGTH: 39 amino acids 

KENVRDKKRATFLLSLWECSLPQARLCLIV 
1 5 10 15 20 25 30 

SRTGLLVQS 
35 

SEQUENCE ID NO: 137 

SEQUENCE LENGTH: 40 amino acids 

KENVRDKKKRATFLLSLWECSLPQARLCLI 
! 5 10 15 20 25 30 

VSRTGLLVQS 
35 40 

SEQUENCE ID NO: 138 

SEQUENCE LENGTH: 28 amino acids 

KENVRDKKKGQHFYWHCGSAACHRRGCV 
1 5 10 15 20 25 



SEQUENCE ID NO: 139 

SEQUENCE LENGTH: 27 amino acids 



KENVRDKKGQHFYWHCGSAACHRRGCV 
1 5 10 15 20 25 

SEQUENCE ID NO: 140 

SEQUENCE LENGTH: 39 amino acids 

ITHTRWGITTWDSWSVRMKANWIQAQQNKS 
15 10 15 20 25 3 

LILSPSFTK 
35 



SEQUENCE ID NO: 141 

SEQUENCE LENGTH: 40 amino acids 

KITHTRWGI TTWDSWSVRMKANWIQAQQN 
1 5 10 15 20 25 

SLILSPSFTK 
35 40 

SEQUENCE ID NO: 142 

SEQUENCE LENGTH: 16 amino acids 
KLLTPGGELPHGILGQ 
1 5 10 15 



SEQUENCE ID NO: 143 

SEQUENCE LENGTH: 15 amino acids 
LLTPGGELPHGILGQ 
15 10 15 



SEQUENCE ID NO: 144 

SEQUENCE LENGTH: 47 amino acids 

PPVCELEKITHTRWGITTWDSWSVRMKANW 
1 5 10 15 20 25 30 

IQAQQNKSLILSPS FTK 
35 40 45 



SEQUENCE ID NO: 145 

SEQUENCE LENGTH: 48 amino acids 



PPVCELEKKITHTRWGITTWDSWSVRMKAN 
1 5 10 15 20 25 3 

WIQAQQNKSLILSPSFTK 
35 40 45 

SEQUENCE ID NO: 146 

SEQUENCE LENGTH: 24 amino acids 

PPVCELEKKLLTPGGELPHGILGQ 
15 10 15 20 

SEQUENCE ID NO: 147 

SEQUENCE LENGTH: 23 amino acids 

PPVCELEKLLTPGGELPHGILGQ 
15 10 15 20 

SEQUENCE ID NO: 148 

SEQUENCE LENGTH: 11 amino acids 
SLKDELEKMKI 
15 10 

SEQUENCE ID NO: 149 

SEQUENCE LENGTH: 12 amino acids 

SLKDELEKKMKI 

1 5 10 

SEQUENCE ID NO: 150 

SEQUENCE LENGTH: 12 amino acids 
LGQSSPEKKNKN 
15 10 

SEQUENCE ID NO: 151 

SEQUENCE LENGTH: 11 amino acids 
LGQSSPEKNKN 
15 10 



SEQUENCE ID NO: 152 



SEQUENCE LENGTH: 23 amino acids 

RLRRINGRGSQIRSRNAFNRSEE 
15 10 15 20 



SEQUENCE ID NO: 153 

SEQUENCE LENGTH: 10 amino acids 

EPKVKEEKKT 

1 5 10 

SEQUENCE ID NO: 154 

SEQUENCE LENGTH: 11 amino acids 
EPKVKEEKKKT 
15 10 

SEQUENCE ID NO: 155 

SEQUENCE LENGTH: 32 amino acids 

EPKVKEEKKRLRRINGRGSQIRSRNAFNRS 
1 5 10 15 20 25 30 

E E 

SEQUENCE ID NO: 156 

SEQUENCE LENGTH: 31 amino acids 

EPKVKEEKRLRRINGRGSQIRSRNAFNRSE 
1 5 10 15 20 25 30 

E 

SEQUENCE ID NO: 157 

SEQUENCE LENGTH: 14 amino acids 
TFRYKGKQHPFFST 
15 10 

SEQUENCE ID NO: 158 

SEQUENCE LENGTH: 10 amino acids 

GPNAPEEKNH 

15 10 



SEQUENCE ID NO: 159 

SEQUENCE LENGTH: 11 amino acids 
GPNAPEEKKNH 
15 10 

SEQUENCE ID NO: 160 

SEQUENCE LENGTH: 23 amino acids 

GPNAPEEKKTFRYKGKQHPFFST 
15 10 15 20 

SEQUENCE ID NO: 161 

SEQUENCE LENGTH: 22 amino acids 
GPNAPEEKTFRYKGKQHPFFST 

15 10 15 20 

SEQUENCE ID NO: 162 

SEQUENCE LENGTH: 6 amino acids 
M Q N T C V 
1 5 

SEQUENCE ID NO: 163 

SEQUENCE LENGTH: 7 amino acids 
K M Q N T C V 
1 5 

SEQUENCE ID NO: 164 

SEQUENCE LENGTH: 9 amino acids 

KCKIRVFSK 

1 5 

SEQUENCE ID NO: 165 

SEQUENCE LENGTH: 8 amino acids 

CKIRVFSK 

1 5 



SEQUENCE ID NO: 166 



SEQUENCE LENGTH: 14 amino acids 
FFKRTVQKMQNTCV 
15 10 

SEQUENCE ID NO: 167 

SEQUENCE LENGTH: 15 amino acids 
FFKRTVQKKMQNTCV 
15 10 15 

SEQUENCE ID NO: 168 

SEQUENCE LENGTH: 17 amino acids 

FFKRTVQKKCKIRVFSK 

15 10 15 

SEQUENCE ID NO: 169 

SEQUENCE LENGTH: 16 amino acids 

FFKRTVQKCKIRVFSK 

15 10 15 

SEQUENCE ID NO: 170 

SEQUENCE LENGTH: 7 amino acids 

L P H Y L A H 

1 5 

SEQUENCE ID NO: 171 

SEQUENCE LENGTH: 8 amino acids 

CLITWLTN 

1 5 

SEQUENCE ID NO: 172 

SEQUENCE LENGTH: 17 amino acids 
GSTTGLSATPLPHYLAH 
15 10 15 

SEQUENCE ID NO: 173 

SEQUENCE LENGTH: 118 amino acids 



GSTTGLSATPPLP 
15 10 



H Y L A H 
15 



SEQUENCE ID NO: 174 

SEQUENCE LENGTH: 19 amino acids 
GSTTGLSATPPCLITWLTN 
15 10 15 

SEQUENCE ID NO: 175 

SEQUENCE LENGTH: 18 amino acids 
GSTTGLSATPCLITWLTN 
15 10 15 



SEQUENCE ID NO: 176 

SEQUENCE LENGTH: 9 amino acids 

RFADKPRPN 

1 5 



SEQUENCE ID NO: 177 

SEQUENCE LENGTH: 20 amino acids 
DLPTSPDQTRSGPVHVSVEP 
15 10 15 20 



SEQUENCE ID NO: 178 

SEQUENCE LENGTH: 19 amino acids 
DSAAGCSGTPRFADKPRPN 
15 10 15 



SEQUENCE ID NO: 179 

SEQUENCE LENGTH: 20 amino acids 
DSAAGCSGTPPRFADKPRPN 
15 10 15 20 



SEQUENCE ID NO: 180 

SEQUENCE LENGTH: 31 amino acids 

DSAAGCSGTPPDLPTSPDQTRSGPVHVSVE 



SEQUENCE ID NO: 181 

SEQUENCE LENGTH: 30 amino acids 

DSAAGCSGTPDLPTSPDQTRSGPVHVSVEP 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 182 

SEQUENCE LENGTH: 53 amino acids 

AHPE TPAQNRLRI PCS RREVRS RACKPPGA 
1 5 10 15 20 25 30 

QGSDERRGKAS PGRDCDVRTGRP 
35 40 45 50 



SEQUENCE ID NO: 183 

SEQUENCE LENGTH: 54 amino acids 

PAHPETPAQNRLRIPCSRREVRSRACKPPG 
1 5 10 15 20 25 3 

AQGSDERRGKAS PGRDC.DVRTGRP 
35 40 45 50 



SEQUENCE ID NO: 184 

SEQUENCE LENGTH: 20 amino acids 

RPTRRHPRRIASGSPAVGGR 
15 10 15 20 



SEQUENCE ID NO: 185 

SEQUENCE LENGTH: 63 amino acids 
VAIRGHPRPPAHPETP 
15 10 15 

RSRACKPPGAQGSDER 
35 40 45 

G R P 



AQNRLRI PCSRREV 
20 25 30 

RGKAS PGRDCDVRT 
50 55 60 



SEQUENCE ID NO: 186 



SEQUENCE LENGTH: 64 amino acids 

VAIRGHPRPPPAHPETPAQNRLRIPCSRRE 
15 10 15 20 25 30 

VRS RACKP PGAQGS DERRGKAS PGRDCDVR 
35 40 45 50 55 60 

T G R P 

SEQUENCE ID NO: 187 

SEQUENCE LENGTH: 30 amino acids 

VAI RGHPRPPRPTRRHPRRIASGSPAVGGR 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 188 

SEQUENCE LENGTH: 29 amino acids 

VAIRGHPRPRPTRRHPRRIASGSPAVGGR 
15 10 15 20 25 

SEQUENCE ID NO: 189 

SEQUENCE LENGTH: 85 amino acids 

RGRTSGRSLSCCRRPRCRPAVASRSTAPSP 
1 5 10 15 20 25 30 

RAGSRRCCLRTSCGAARPRRTRSACGDWVA 

35 40 45 50 55 60 

SPPTRSSSRTACGAASPPARSWSAP 

65 70 75 80 85 

SEQUENCE ID NO: 190 

SEQUENCE LENGTH: 8 amino acids 

GGGHLEEV 

1 5 

SEQUENCE ID NO: 191 

SEQUENCE LENGTH: 94 amino acids 

YFGGPDSTPRGRTSGRSLSCCRRPRCRPAV 

1 5 10 15 20 25 30 

ASRSTAPSPRAGSRRCCLRTSCGAARPRRT 



35 40 45 50 55 60 

RSACGDWVASPPTRSSSRTACGAASPPARS 
65 70 75 80 85 

W S A P 



90 



SEQUENCE ID NO: 192 

SEQUENCE LENGTH: 95 amino acids 

YFGGPDSTPPRGRTSGRSLSCCRRPRCRPA 
1 5 10 15 20 25 30 

VASRSTAPSPRAGSRRCCLRTSCGAARPRR 
35 40 45 50 55 60 

TRSACGDWVASPPTRSSSRTACGAASPPAR 
65 70 75 80 85 90 

S W S A P 

95 



SEQUENCE ID NO: 193 

SEQUENCE LENGTH: 18 amino acids 

YFGGPDSTPPGGGHLEEV 
1 5 10 15 

SEQUENCE ID NO: 194 

SEQUENCE LENGTH: 17 amino acids 

YFGGPDSTPGGGHLEEV 
15 10 15 

SEQUENCE ID NO: 195 

SEQUENCE LENGTH: 6 amino acids 
H R V A D P 
1 5 



SEQUENCE ID NO: 196 

SEQUENCE LENGTH: 13 amino acids 
LSQSSELDPPSSR 
15 10 



SEQUENCE ID NO: 197 

SEQUENCE LENGTH: 14 amino acids 

LSQSSELDPPPSSR 

15 10 

SEQUENCE ID NO: 198 

SEQUENCE LENGTH: 16 amino acids 
LSQSSELDPPHRVADP 
1 5 10 15 

SEQUENCE ID NO: 199 

SEQUENCE LENGTH: 15 amino acids 
LSQSSELDPHRVADP 
15 10 15 

SEQUENCE ID NO: 200 

SEQUENCE LENGTH: 11 amino acids 
VILLPEDTPPS 
15 10 

SEQUENCE ID NO: 201 

SEQUENCE LENGTH: 12 amino acids 
VILLPEDTPPPS 
15 10 

SEQUENCE ID NO: 202 

SEQUENCE LENGTH: 14 amino acids 
VILLPEDTPPLLRA 
15 10 

SEQUENCE ID NO: 203 

SEQUENCE LENGTH: 13 amino acids 
VILLPELDPLLRA 
15 10 



SEQUENCE ID NO: 204 



SEQUENCE LENGTH: 5 amino acids 
P S P L P 
1 5 

SEQUENCE ID NO: 205 

SEQUENCE LENGTH: 25 amino acids 

PLLFHRPCS PS PALGATVLAVYRYE 
15 10 15 20 25 

SEQUENCE ID NO: 206 

SEQUENCE LENGTH: 24 amino acids 

LLFHRPCSPSPALGATVLAVYRYE 
15 10 15 20 

SEQUENCE ID NO: 207 

SEQUENCE LENGTH: 13 amino acids 
APRPPLGPPSPLP 
15 10 

SEQUENCE ID NO: 208 

SEQUENCE LENGTH: 14 amino acids 
APRPPLGPPPSPLP 
1 5 10 

SEQUENCE ID NO: 209 

SEQUENCE LENGTH: 34 amino acids 

APRPPLGPPPLLFHRPCSPSPALGATVLAV 
1 5 10 15 20 25 3 

Y R Y E 

SEQUENCE ID NO: 210 

SEQUENCE LENGTH: 33 amino acids 

APRPPLGPPLLFHRPCSPS PALGATVLAVY 
1 5 10 15 20 25 3( 

RYE 



SEQUENCE ID NO: 211 

SEQUENCE LENGTH: 28 amino acids 

TQVLPQGCSLSLLHTTFPHRQVPHILDW 
1 5 10 15 20 25 

SEQUENCE ID NO: 212 

SEQUENCE LENGTH: 29 amino acids 

PTQVLPQGCSLSLLHTTFPHRQVPHILDW 
1 5 10 15 20 25 

SEQUENCE ID NO: 213 

SEQUENCE LENGTH: 54 amino acids 

PLQS FPKDAASAFSTPRFPTDKFPTSWTGS 
! 5 10 15 20 25 30 

CPGQPHGTRAFCQPGPEFNAFSAC 
35 40 45 50 

SEQUENCE ID NO: 214 

SEQUENCE LENGTH: 53 amino acids 

LQSFPKDAASAFSTPRFPTDKFPTSWTGSC 
1 5 10 15 20 25 30 

PGQPHGTRAFCQPGPEFNAFSAC 
35 40 45 50 

SEQUENCE ID NO: 215 

SEQUENCE LENGTH: 38 amino acids 

PSPRPQSQPPTQVLPQGCSLSLLHTTFPHR 
1 5 10 15 20 25 30 

QVPHILDW 
35 

SEQUENCE ID NO: 216 

SEQUENCE LENGTH: 39 amino acids 

PSPRPQSQPPPTQVLPQGCSLSLLHTTFPH 
1 5 10 15 20 25 30 



RQVPHILDW 
35 



SEQUENCE ID NO: 217 

SEQUENCE LENGTH: 64 amino acids 

PSPRPQSQPPPLQS FPKDAASAFSTPRFPT 
1 5 10 15 20 25 30 

DKFPTSWTGSCPGQPHGTRAFCQPGPEFNA 
35 40 45 50 55 60 

F S A C 



SEQUENCE ID NO: 218 

SEQUENCE LENGTH: 63 amino acids 

PSPRPQSQPPLQSFPKDAASAFSTPRFPTD 
1 5 10 15 20 25 30 

KFPTSWTGSCPGQPHGTRAFCQPGPEFNAF 
35 40 45 50 55 60 

SAC 

SEQUENCE ID NO: 219 

SEQUENCE LENGTH: 30 amino acids 

TAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 220 

SEQUENCE LENGTH: 31 amino acids 

PTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 221 

SEQUENCE LENGTH: 50 amino acids 

PRPGPAGGALLPRSLTAFVPHSGHGLPVSS 
1 5 10 15 20 25 30 

GEPAYTPI PHDVPHGTPPFC 
35 40 45 50 



SEQUENCE ID NO: 222 

SEQUENCE LENGTH: 49 amino acids 
RPGPAGGALLPRSLTA 
15 10 15 

EPAYTPIPHDVPHGTP 
35 40 45 

SEQUENCE ID NO: 223 

SEQUENCE LENGTH: 39 amino acids 
DLPAVPGPPTAWPGRR 
1 5 10 15 

GPWAPRFLW 
35 

SEQUENCE ID NO: 224 

SEQUENCE LENGTH: 40 amino 
DLPAVPGPPPTAW 
15 10 
LGPWAPRFLW 
35 40 

SEQUENCE ID NO: 225 

SEQUENCE LENGTH: 59 amino acids 
DLPAVPGPPPRPGPAG 
15 10 15 

SGHGLPVSSGEPAYTP 
35 40 45 



FVPHSGHGLPVSSG 
20 25 30 

P F C 



RFTTPEPYCLCTPL 
20 25 30 



GALLPRS LTAFVPH 
20 25 30 

IPHDVPHGTPPFC 
50 55 



acids 

PGRRRFTTPEPYCLCTP 
15 20 25 30 



SEQUENCE ID NO: 226 

SEQUENCE LENGTH: 58 amino acids 

DLPAVPGPPRPGPAGGALLPRSLTAFVPHS 
1 5 10 15 20 25 30 

GHGLPVSSGEPAYTPI PHDVPHGTPPFC 
35 40 45 50 55 



SEQUENCE ID NO: 227 

SEQUENCE LENGTH: 8 amino acids 

QWGLSWMS 

1 5 

SEQUENCE ID NO: 228 

SEQUENCE LENGTH: 14 amino acids 
NGDCHGCPEGRQSL 
15 10 

SEQUENCE ID NO: 229 

SEQUENCE LENGTH: 17 amino acids 

FTMDRVLTPQWGLSWMS 
15 10 15 

SEQUENCE ID NO: 230 

SEQUENCE LENGTH: 18 amino acids 
FTMDRVLTPPQWGLSWMS 
15 10 15 

SEQUENCE ID NO: 231 

SEQUENCE LENGTH: 24 amino acids 

FTMDRVLTPPNGDCHGCPEGRQSL 
15 10 15 20 

SEQUENCE ID NO: 232 

SEQUENCE LENGTH: 23 amino acids 

FTMDRVLTPNGDCHGCPEGRQSL 
15 10 15 20 

SEQUENCE ID NO: 233 

SEQUENCE LENGTH: 115 amino acids 

HHPARQCPHCIMHLQTQLIHRNLTGPSQLT 
1 5 10 15 20 25 30 

SLHRSPYQIAATPWTTDFAAS FFLNPVTPF 
35 40 45 50 55 60 



LLCRRCQGKDVLCTN A R CLSQTSPSHHKAL 
65 70 75 80 85 90 

SRTTTQCMNTTPWLAVRPAKAFPLL 

95 100 105 110 115 

SEQUENCE ID NO: 234 

SEQUENCE LENGTH: 116 amino acids 

PHHPARQCPHCIMHLQTQLIHRNLTGPSQL 
1 5 10 15 20 25 30 

TSLHRSPYQIAATPWTTDFAASFFLNPVTP 

35 40 45 50 55 60 

FLLCRRCQGKDVLCTNARCLSQTSPSHHKA 

65 70 75 80 85 90 

LSRTTTQCMNTTPWLAVRPAKAFPLL 

95 100 105 110 115 



SEQUENCE ID NO: 235 

SEQUENCE LENGTH: 23 amino acids 

HTIQHASVPTASCISKLNSYTEN 
1 5 10 15 20 

SEQUENCE ID NO: 236 

SEQUENCE LENGTH: 126 amino acids 

PQVGMRPSNPPHHPARQCPHCIMHLQTQLI 
1 5 10 15 20 25 30 

HRNLTGPSQLTSLHRSPYQIAATPWTTDFA 

35 40 45 50 55 60 

AS FFLNPVTPFLLCRRCQGKDVLCTNARCL 

65 70 75 80 85 90 

SQTSPSHHKALSRTTTQCMNTTPWLAVRPA 

95 100 105 110 115 120 

K A F P L L 

125 



SEQUENCE ID NO: 237 

SEQUENCE LENGTH: 127 amino acids 



PQVGMRPSNPPPHHPARQCPHC IMHLQTQL 
1 5 10 15 20 25 30 

IHRNLTGPSQLTSLHRSPYQIAATPWTTDF 
35 40 45 50 55 60 

AASFFLNPVTPFLLCRRCQGKDVLCTNARC 

65 70 75 80 85 90 

LSQTSPSHHKALSRTTTQCMNTTPWLAVRP 

95 100 105 no US 120 

A K A F P L L 

125 



SEQUENCE ID NO: 238 

SEQUENCE LENGTH: 34 amino acids 

PQVGMRPSNPPHTIQHASVPTASCISKLNS 

30 



1 5 10 15 20 25 

Y T E N 



SEQUENCE ID NO: 239 

SEQUENCE LENGTH: 33 amino acids 

PQVGMRPSNPHTIQHASVPTASCISKLNSY 

30 



1 5 10 15 20 25 



TEN 



SEQUENCE ID NO: 240 

SEQUENCE LENGTH: 51 amino acids 

WAARSWCERRAAAVAPLAPWAWGCPAGCTP 
1 5 10 15 20 25 30 

PVAARACAATRPEGWRS PCTH 
35 40 45 50 

SEQUENCE ID NO: 241 

SEQUENCE LENGTH: 52 amino acids 

PWAARSWCERRAAAVAPLAPWAWGCPAGCT 
1 5 10 15 20 25 30 



PPVAARACAATRPEGWRSPCTH 
35 40 45 50 



SEQUENCE ID NO: 242 

SEQUENCE LENGTH: 74 amino acids 

RGLRGAGARGGLRLLRHLRPGLGDALRGVH 

1 5 10 15 20 25 30 

PPLRLGPALLPAPRGGEAPAHTDARARRVH 

35 40 45 50 55 60 

GAGGDRGHPGPAAL 

65 70 

SEQUENCE ID NO: 243 

SEQUENCE LENGTH: 61 amino acids 

EEKLARCRPPWAARSWCERRAAAVAPLAPW 

1 5 10 15 20 25 30 

AWGCPAGCTPPVAARACAATRPEGWRSPCTH 

35 40 45 50 55 60 

SEQUENCE ID NO: 244 

SEQUENCE LENGTH: 62 amino acids 

EEKLARCRPPPWAARSWCERRAAAVAPLAP 

1 5 10 15 20 25 30 

WAWGCPAGCTPPVAARACAATRPEGWRS PCT 
35 40 45 50 55 60 

SEQUENCE ID NO: 245 

SEQUENCE LENGTH: 84 amino acids 

EEKLARCRPPRGLRGAGARGGLRLLRHLRP 

1 5 10 15 20 25 30 

GLGDALRGVHPPLRLGPALLPAPRGGEAPA 

35 40 45 50 55 60 

HTDARARRVHGAGGDRGHPGPAAL 

65 70 75 80 



SEQUENCE ID NO: 246 

SEQUENCE LENGTH: 83 amino acids 

EEKLARCRPRGLRGAGARGGLRLLRHLRPG 
1 5 10 15 20 25 30 

LGDALRGVHPPLRLGPALLPAPRGGEAPAH 
35 40 45 50 55 60 

TDARARRVHGAGGDRGHPGPAAL 
65 70 75 80 

SEQUENCE ID NO: 247 

SEQUENCE LENGTH: 163 amino acids 

QPPVSPRPRRPGRPRAPPPPQPMVSPRRRT 

1 5 10 15 20 25 30 

TGPPWRPPPLQSTMSPPPQALHQAQLLLWC 
35 40 45 50 55 60 

TTAPLPGLPQPQPARALHSQFPATTLILLP 

65 70 75 80 85 90 

PLPAIAPRLMPVALT IARYLLS PPP I TALL 

95 100 105 110 115 120 

PSCLLGSLSFSCLFTFQTSSLIPLWKIPAP 

125 130 135 140 145 150 

TTTKSCRETFLKW 
155 160 

SEQUENCE ID NO: 248 

SEQUENCE LENGTH: 85 amino acids 

SPGCHLGPGDQAAPGLHRPPS PWCHLGAGQ 
1 5 10 15 20 25 30 

QARLGVHRPSSPQCHLGLRLCIRLSFYSGA 

35 40 4 5 50 55 60 

QRHLCQGYHNPSQQEHSILNSQPPL 

65 70 75 80 85 

SEQUENCE ID NO: 249 

SEQUENCE LENGTH: 172 amino acids 

KPAPGSTAPQ P PVSPRPR RPGR p RAppppQ 



1 5 10 15 20 25 30 

PMVSPRRRTTGPPWRPPPLQSTMSPPPQAL 

35 40 45 50 55 60 

HQAQLLLWCTTAPLPGLPQPQPARALHSQF 

65 70 75 80 85 90 

PATTLILLPPLPAIAPRLMPVALTIARYLL 

95 100 105 110 115 120 

SPPPITALLPSCLLGSLSFSCLFTFQTSSL 

125 130 135 140 145 150 

IPLWKIPAPTTTKSCRETFLKW 
155 160 165 170 



SEQUENCE ID NO: 250 

SEQUENCE LENGTH: 173 amino acids 

KPAPGSTAPPQPPVSPRPRRPGRPRAPPPP 

1 5 10 15 20 25 30 

QPMVSPRRRTTGPPWRPPPLQSTMSPPPQA 

35 40 45 50 55 60 

LHQAQLLLWCTTAPLPGLPQPQPARALHSQ 

65 70 75 80 85 90 

FPATTLILLPPLPAIAPRLWPVALTIARYL 

95 100 105 110 115 120 

LSPPPITALLPSCLLGSLSFSCLFTFQTSS 

125 130 135 140 145 150 

LIPLWKIPAPTTTKSCRETFLKW 
155 160 165 170 

SEQUENCE ID NO: 251 

SEQUENCE LENGTH: 65 amino acids 

KPAPGSTAPPSPGCHLGPGDQAAPGLHRPP 

1 5 10 15 20 25 30 

SPWCHLGAGQQARLGVHRPSSPQCHLGLRL 



C IRLS FYSGAQRHLCQGYHNPSQQEHS I 

35 40 45 50 55 

S Q P P L 



L N 

60 



65 



SEQUENCE ID NO: 252 

SEQUENCE LENGTH: 94 amino acids 

KPAPGSTAPSPGCHLGPGDQAAPGLHRPPS 
1 5 10 15 20 25 30 

PWCHLGAGQQARLGVHRPSS PQCHLGLRLC 
35 40 45 50 55 60 

IRLSFYSGAQRHLCQGYHNPSQQEHSILNS 
65 70 75 80 85 90 

Q P P L 



SEQUENCE ID NO: 253 

SEQUENCE LENGTH: 113 amino acids 

QPMVSPRRRTTGPPWRPPPLQSTMSPPPQA 
1 5 10 15 20 25 30 

LHQAQLLLWCTTAPLPGLPQPQPARALHSQ 

35 40 45 50 55 60 

FPATTLILLPPLPAIAPRLMPVALT IARYL 

65 70 75 80 85 90 

LSPPPITALLPSCLLGSLSFSCLFTFQTSS 

LIPLWKI PAPTTTKSCRETFLKW 
95 100 105 HO 



SEQUENCE ID NO: 254 

SEQUENCE LENGTH: 65 amino acids 

S PWCHLGAGQQARLGVHRPS S PQCHLGLRL 
1 5 10 15 20 25 30 

CIRLSFYSGAQRHLCQGYHNPSQQEHSILN 

35 40 45 50 55 60 

S Q P P L 

65 



SEQUENCE ID NO: 255 

SEQUENCE LENGTH: 18 amino acids 
RPPPGSTAPQPMVSPRRR 
15 10 15 

SEQUENCE ID NO: 256 

SEQUENCE LENGTH: 19 amino acids 
RPPPGSTAPPQPMVSPRRR 
15 10 15 

SEQUENCE ID NO: 257 

SEQUENCE LENGTH: 18 amino acids 
RPPPGSTAPPSPWCHLGA 
15 10 15 

SEQUENCE ID NO: 258 

SEQUENCE LENGTH: 17 amino acids 
RPPPGSTAPSPWCHLGA 
15 10 15 

SEQUENCE ID NO: 259 

SEQUENCE LENGTH: 14 amino acids 
RPRAPPPPSPWCHL 
1 5 10 

SEQUENCE ID NO: 260 

SEQUENCE LENGTH: 13 amino acids 
RPRAPPPPPSPWC 
15 10 

SEQUENCE ID NO: 261 

SEQUENCE LENGTH: 16 amino acids 
RPRAPPPPAHGVTSAP 
1 5 10 15 



SEQUENCE ID NO: 262 



* v ' . 



SEQUENCE LENGTH: 13 amino acids 
RPRAPPPPPAHGV 
15 10 

SEQUENCE ID NO: 263 

SEQUENCE LENGTH: 14 amino acids 

APGLHRPPQPMVSP 

15 10 

SEQUENCE ID NO: 264 

SEQUENCE LENGTH: 15 amino acids 
AAPGLHRPQPMVS PR 
15 10 15 

SEQUENCE ID NO: 265 

SEQUENCE LENGTH: 13 amino acids 
PGLHRPPPAHGVT 
15 10 

SEQUENCE ID NO: 266 

SEQUENCE LENGTH: 14 amino acids 
APGLHRPPAHGVTS 
1 5 10 

SEQUENCE ID NO: 267 

SEQUENCE LENGTH: 21 amino acids 
VDRPQHTEWLSWSNLYRIRHQ 
15 10 15 20 

SEQUENCE ID NO: 268 

SEQUENCE LENGTH: 10 amino acids 

HYLCTDVAPR 

15 10 

SEQUENCE ID NO: 269 

SEQUENCE LENGTH: 11 amino acids 



HYLCTDVAPPR 
15 10 

SEQUENCE ID NO: 270 

SEQUENCE LENGTH: 31 amino acids 

HYLCTDVAPPVDRPQHTEWLSWSNLYRIRH 
1 5 10 15 20 25 30 

Q 

SEQUENCE ID NO: 271 

SEQUENCE LENGTH: 30 amino acids 

HYLCTDVAPVDRPQHTEWLSWSNLYRIRHQ 
! 5 10 15 20 25 30 

SEQUENCE ID NO: 272 

SEQUENCE LENGTH: 108 amino acids 

SAYLSPLGTTWLRTCACRLPRPAASCLCTT 

1 5 10 15 20 25 30 

PSLLWPRRTCPAGSPRATSSPWRMPAPKSC 

35 40 45 50 55 60 

CTTGLAFTSPIGLGWRSATASGYARIWPVL 

65 70 75 80 85 90 

SLTCQSWSTSLPSTAVTW 
95 100 105 

SEQUENCE ID NO: 273 

SEQUENCE LENGTH: 109 amino acids 

PSAYLSPLGTTWLRTCACRLPRPAASCLCT 
1 5 10 15 20 25 30 

TPSLLWPRRTCPAGSPRATSS PWRMPAPKS 

35 40 45 50 55 60 

CCTTGLAFTSPIGLGWRSATASGYARIWPV 

65 70 75 80 85 90 

LSLTCQSWSTSLPSTAVTW 
95 100 105 



SEQUENCE ID NO: 274 

SEQUENCE LENGTH: 12 amino acids 
PAPI FLLWGPLG 
15 10 

SEQUENCE ID NO: 275 

SEQUENCE LENGTH: 11 amino acids 
API FLLWGPLG 
15 10 

SEQUENCE ID NO: 276 

SEQUENCE LENGTH: 117 amino acids 

LP ARAPGPPSAYLSPLGTTWLRTCACRLPR 
1 5 10 15 20 25 30 

PAASCLCTTPSLLWPRRTCPAGSPRATSSP 

35 40 45 50 55 60 

WRMPAPKSCCTTGLAFTSPIGLGWRSATAS 

65 70 75 80 85 90 

GYARIWPVLSLTCQSWSTSLPSTAVTW 

95 100 105 110 115 

SEQUENCE ID NO: 277 

SEQUENCE LENGTH: 118 amino acids 

LPARAPGPPPSAYLSPLGTTWLRTCACRLP 
1 5 10 15 20 25 30 

RPAASCLCTTPSLLWPRRTCPAGSPRATSS 

35 40 45 50 55 60 

PWRMPAPKSCCTTGLAFTSPIGLGWRSATA 

65 70 75 80 85 90 

SGYARIWPVLSLTCQSWSTSLPSTAVTW 

95 100 105 110 115 

SEQUENCE ID NO: 278 

SEQUENCE LENGTH: 21 amino acids 
LPARAPGPPPAPI FLLWGPLG 
15 10 15 20 



SEQUENCE ID NO: 279 

SEQUENCE LENGTH: 20 amino acids 
LPARAPGPPAP I FLLWGPLG 
15 10 15 20 

SEQUENCE ID NO: 280 

SEQUENCE LENGTH: 14 amino acids 
DLEHHGGVTRHRHR 
15 10 



SEQUENCE ID NO: 281 

SEQUENCE LENGTH: 11 amino acids 
L V SDYSMTPRP 
15 10 



SEQUENCE ID NO: 282 

SEQUENCE LENGTH: 12 amino acids 
LVSDYSMTPPRP 
15 10 

SEQUENCE ID NO: 283 

SEQUENCE LENGTH: 24 amino acids 

LVSDYSMTPPDLEHHGGVTRHRHR 
15 10 15 20 

SEQUENCE ID NO: 284 

SEQUENCE LENGTH: 23 amino acids 

LVSDYSMTPDLEHHGGVTRHRHR 
15 10 15 20 



SEQUENCE ID NO: 285 

SEQUENCE LENGTH: 51 amino acids 

FHHIATDVGPFVRIGFLKIKGKIKGKSLRK 
1 5 10 15 20. 25 30 

PNWKTQHKLKRALMFLIVKKL 



35 40 45 50 

SEQUENCE ID NO: 286 

SEQUENCE LENGTH: 52 amino acids 
seq id no 286; 

PFHHIATDVGPFVRIGFLKIKGKIKGKSLR 
1 5 10 15 20 25 30 

KPNWKTQHKLKRALMFLIVKKL 
35 40 45 50 

SEQUENCE ID NO: 287 

SEQUENCE LENGTH: 12 amino acids 
PSITLQQMLAPS 
15 10 

SEQUENCE ID NO: 288 

SEQUENCE LENGTH: 11 amino acids 
SITLQQMLAPS 
15 10 

SEQUENCE ID NO: 289 

SEQUENCE LENGTH: 60 amino acids 

TSCNEMNPPFHH IATDVGP FVRI G FLKI KG 
1 5 10 15 20 25 30 

KIKGKSLRKPNWKTQHKLKRALMFL IVKKL 
35 40 45 50 55 60 

SEQUENCE ID NO: 290 

SEQUENCE LENGTH: 61 amino acids 

TSCNEMNPPPFHHIATDVGPFVRIGFLKIK 
1 5 10 15 20 25 30 

GKIKGKSLRKPNWKTQHKLKRALMFLIVKK 
35 40 45 50 55 60 



SEQUENCE ID NO: 291 



SEQUENCE LENGTH: 20 amino acids 
TSCNEMNPPSITLQQMLAPS 
15 10 15 20 

SEQUENCE ID NO: 292 

SEQUENCE LENGTH: 21 amino acids 
TSCNEMNPPPSITLQQMLAPS 
15 10 15 20 

SEQUENCE ID NO: 293 

SEQUENCE LENGTH: 10 amino acids 

LEMILFLMTF 

15 10 

SEQUENCE ID NO: 294 

SEQUENCE LENGTH: 18 amino acids 
HPCITKTFLEMILFLMTF 
15 10 15 

SEQUENCE ID NO: 295 

SEQUENCE LENGTH: 19 amino acids 
HPCITKTFFLEMILFLMTF 
15 10 15 

SEQUENCE ID NO: 296 

SEQUENCE LENGTH: 11 amino acids 
HPCITKTFFWR 
15 10 

SEQUENCE ID NO: 297 

SEQUENCE LENGTH: 10 amino acids 

HPCITKTFWR 

15 10 



SEQUENCE ID NO: 298 

SEQUENCE LENGTH: 22 amino acids 



LMFEHSQMRLNSKNAHLPIISF 
15 10 15 20 



SEQUENCE ID NO: 299 

SEQUENCE LENGTH: 30 amino acids 

EYGS I IAFLMFEHSQMRLNSKNAHLPI IS F 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 300 

SEQUENCE LENGTH: 31 amino acids 

EYGS I IAFFLMFEHSQMRLNSKNAHLP I I S 
1 5 10 15 20 25 30 

F 

SEQUENCE ID NO: 301 

SEQUENCE LENGTH: 15 amino acids 
HLNKGRRLGDKIRAT 
15 10 15 

SEQUENCE ID NO: 302 

SEQUENCE LENGTH: 16 amino acids 
FHLNKGRRLGDKIRAT 
15 10 15 

SEQUENCE ID NO: 303 

SEQUENCE LENGTH: 23 amino acids 

VTSGTPFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 304 

SEQUENCE LENGTH: 24 amino acids 

VTSGTPFFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 305 

SEQUENCE LENGTH: 10 amino acids 



VTSGTPFFFI 
15 10 



SEQUENCE ID NO: 306 

SEQUENCE LENGTH: 9 amino acids 

VTSGTPFFI 

1 5 

SEQUENCE ID NO: 307 

SEQUENCE LENGTH: 10 amino acids 

CEIERIHFFF 

15 10 

SEQUENCE ID NO: 308 

SEQUENCE LENGTH: 11 amino acids 
CEIERIHFFSK 
15 10 

SEQUENCE ID NO: 309 

SEQUENCE LENGTH: 10 amino acids 

CEIERIHFSK 

15 10 

SEQUENCE ID NO: 310 

SEQUENCE LENGTH: 8 amino acids 

FRYISKSI 

1 5 

SEQUENCE ID NO: 311 

SEQUENCE LENGTH: 7 amino acids 
R Y I S K S I 
1 5 

SEQUENCE ID NO: 312 

SEQUENCE LENGTH: 16 amino acids 

FKKYEPIF F.R Y I S K S I 



1 5 



10 



SEQUENCE ID NO: 313 

SEQUENCE LENGTH: 15 amino acids 
FKKYEPI FRYISKS I 
15 10 15 

SEQUENCE ID NO: 314 

SEQUENCE LENGTH: 56 amino acids 

FPDSDQPGPLYPLDPSCLISSASNPQE 
1 5 10 15 20 25 

CHYIHLAFGFSNWRSCPVLPGHCGVQ 
35 40 45 50 55 

SEQUENCE ID NO: 315 

SEQUENCE LENGTH: 55 amino acids 

PDSDQPGPLYPLDPSCLISSASNPQEL 
1 5 10 15 20 25 

HYIHLAFGFSNWRSCPVLPGHCGVQ 
35 40 45 50 55 

SEQUENCE ID NO: 316 

SEQUENCE LENGTH: 9 amino acids 
LNMFASVFS 
1 5 

SEQUENCE ID NO: 317 

SEQUENCE LENGTH: 10 amino acids 
LNMFASVFFS 
1 5 10 15 

SEQUENCE ID NO: 318 

SEQUENCE LENGTH: 64 amino acids 

LNMFASVFFPDSDQPGPLYPLDPSCLISSA 
1 5 10 15 20 25 30 

SNPQELSDCHYIHLAFGFSNWRSCPVLPGH 



LSD 
30 



S D C 
30 



35 40 45 50 55 60 

C G V Q 

SEQUENCE ID NO: 319 

SEQUENCE LENGTH: 63 amino acids 

LNMFASVFPDSDQPGPLYPLDPSCLISSAS 

1 5 10 15 20 25 30 

NPQELSDCHYIHLAFGFSNWRSCPVLPGHC 

35 40 45 50 55 60 

G V Q 

SEQUENCE ID NO: 320 

SEQUENCE LENGTH: 63 amino acids 

AMEETVVVAVATVETEVEAMEETGVVAAME 

1 5 10 15 20 25 30 

ETEVGATEETEVAMEAKWEEETTTEMISAT 

35 40 45 50 55 60 

D H T 



SEQUENCE ID NO: 321 

SEQUENCE LENGTH: 55 amino acids 

LWVRPWLWEWLRWRPKWRLWRRQEWWRLWR 

1 5 10 15 20 25 30 

RPRWGLRRRPRWLWRENGRKKRLQK 
35 40 45 50 55 



SEQUENCE ID NO: 322 

SEQUENCE LENGTH: 71 amino acids 

YGGDRSRGAMEETVVVAVATVETEVEAMEE 

1 5 10 15 20 25 30 

TGVVAAMEETEVGATEETEVAMEAKWEEET 

35 40 45 50 55 60 

TTEMISATDHT 

65 70 



SEQUENCE ID NO: 323 

SEQUENCE LENGTH: 72 amino acids 
YGGDRSRGGAMEETVV 
15 10 15 

ETGVVAAMEETEVGAT 

35 40 45 

TTTEMISATDHT 

65 70 



VAVATVETEVEAME 
20 25 30 

EETEVAMEAKWEEE 
50 55 60 



SEQUENCE ID NO: 324 

SEQUENCE LENGTH: 64 amino acids 
YGGDRSRGGLWVRPWL 
15 10 15 

RQEWWRLWRRPRWGLR 
35 40 45 

R L Q K 



WEWLRWEPKWRLWR 
20 25 30 

RRPRWLWRENGRKK 
50 55 60 



SEQUENCE ID NO: 325 

SEQUENCE LENGTH: 63 amino acids 

YGGDRSRGLWVRPWL WE WLRWEPKWRLWRR 
1 5 10 15 20 25 30 

QEWWRLWRRPRWGLRRRPRWLWRENGRKKR 
35 40 45 50 55 60 

L Q K 

SEQUENCE ID NO: 326 

SEQUENCE LENGTH: 9 amino acids 

EFGGGRRQK 

1 5 



SEQUENCE ID NO: 327 

SEQUENCE LENGTH: 8 amino acids 

EFGGRRQK 

1 5 



SEQUENCE ID NO: 328 

SEQUENCE LENGTH: 15 amino acids 
RRAKGGGAGASNPRQ 
15 10 15 

SEQUENCE ID NO: 329 

SEQUENCE LENGTH: 16 amino acids 
GRRAKGGGAGASNPRQ 
15 10 15 

SEQUENCE ID NO: 330 

SEQUENCE LENGTH: 21 amino acids 
DVGLREGALELPTRGNKRNVA 

15 10 15 20 

SEQUENCE ID NO: 331 

SEQUENCE LENGTH: 24 amino acids 

MRGGGGVGGRRAKGGGAGASNPRQ 
15 10 15 20 

SEQUENCE ID NO: 332 

SEQUENCE LENGTH: 25 amino acids 

MRGGGGVGGGRRAKGGGAGASNPRQ 
1 5 10 15 20 25 

SEQUENCE ID NO: 333 

SEQUENCE LENGTH: 30 amino acids 

MRGGGGVGGDVGLREGALELPTRGNKRNVA 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 334 

SEQUENCE LENGTH: 29 amino acids 

MRGGGGVGDVGLREGALELPTRGNKRNVA 
1 5 10 15 20 25 



SEQUENCE ID NO: 335 



SEQUENCE LENGTH: 25 amino acids 
VWQLAGPMLAGWRSLGS 
15 10 15 



WFCRMYGI 
20 25 



SEQUENCE ID NO: 336 

SEQUENCE LENGTH: 4 6 amino acids 

CGSWPALCWRAGGVWAVGSAGCMEYDPEAL 
1 5 10 15 20 25 30 

PAAWGPAAAATVHPRR 
35 40 45 

SEQUENCE ID NO: 337 

SEQUENCE LENGTH: 33 amino acids 

RRYPCEWGVWQLAGPMLAGWRSLGSWFCRM 

1 5 10 15 20 25 30 

Y G I 



SEQUENCE ID NO: 338 

SEQUENCE LENGTH: 34 amino acids 

RRYPCEWGGVWQLAGPMLAGWRSLGSWFCR 
1 5 10 15 20 25 30 

M Y G I 

SEQUENCE ID NO: 339 

SEQUENCE LENGTH: 55 amino acids 

RRYPCEWGGCGSWPALCWRAGGVWAVGSAG 
1 5 10 15 20 25 30 

CMEYDPEALPAAWGPAAAATVHPRR 
35 40 45 50 55 

SEQUENCE ID NO: 340 

SEQUENCE LENGTH: 54 amino acids 

RRYPCEWGCGSWPALCWRAGGVWAVGSAGC 
1 5 10 15 20 25 30 

MEYDPEALPAAWGPAAAATVHPRR 



35 40 45 50 

SEQUENCE ID NO: 341 

SEQUENCE LENGTH: 43 amino acids 

LWLWAGWTVWWSCGPGEKGHGWPSLPTMAL 
! 5 10 15 20 25 30 

LLLRFSCMRVASY 
35 40 

SEQUENCE ID NO: 342 

SEQUENCE LENGTH: 44 amino acids 

GLWLWAGWTVWWSCGPGEKGHGWPSLPTMA 
1 5 10 15 20 25 30 

LLLLRFSCMRVASY 
35 40 

SEQUENCE ID NO: 343 

SEQUENCE LENGTH: 84 amino acids 

GCGCGPAGQYGGAVGLARRGTAGCLPCPPW 

1 5 10 15 20 25 30 

LCCCCAFPACGLPGTDGWRGWQGSGCVRVS 

35 40 45 50 55 60 

GSAPWAPGFPFSPPCPLCGTQPRW 

65 70 75 80 

SEQUENCE ID NO: 344 

SEQUENCE LENGTH: 83 amino acids 

CGCGPAGQYGGAVGLARRGTAGCLPCPPWL 

1 5 10 15 20 25 30 

CCCCAFPACGLPGTDGWRGWQGSGCVRVSG 

35 40 45 50 55 60 

SAPWAPGFPFSPPCPLCGTQPRW 

65 70 75 80 



SEQUENCE ID NO: 345 

SEQUENCE LENGTH: 51 amino acids 



LAFNVPGGLWLWAGWTVWWS CG PGEKGHGW 
1 5 10 15 20 25 30 

PSLPTMALLLLRFSCMRVASY 
35 40 45 50 

SEQUENCE ID NO: 346 

SEQUENCE LENGTH: 52 amino acids 

LAFNVPGGGLWLWAGWTVWWSCGPGEKGHG 
1 5 10 15 20 25 30 

WPSLPTMALLLLRFSCMRVASY 
35 40 45 50 

SEQUENCE ID NO: 347 

SEQUENCE LENGTH: 92 amino acids 

LAFNVPGGGCGCGPAGQYGGAVGLARRGTA 
1 5 10 15 20 25 30 

GCLPCPPWLCCCCAFPACGLPGTDGWRGWQ 
35 40 45 50 55 60 

GSGCVRVSGSAPWAPGFPFSPPCPLCGTQP 
65 7 0 75 80 85 90 

R W 

SEQUENCE ID NO: 348 

SEQUENCE LENGTH: 91 amino acids 

LAFNVPGGCGCGPAGQYGGAVGLARRGTAG 
1 5 10 15 20 25 30 

CLPCPPWLCCCCAFPACGLPGTDGWRGWQG 
35 40 45 50 55 60 

SGCVRVSGSAPWAPGFPFSPPCPLCGTQPR 
65 70 75 80 85 90 

W 

SEQUENCE ID NO: 349 

SEQUENCE LENGTH: 17 amino acids 
PPMPMPGQREAPGRQEA 
15 10 15 



SEQUENCE ID NO: 350 

SEQUENCE LENGTH: 18 amino acids 

GPPMPMPGQREAPGRQEA 

1 5 10 15 

SEQUENCE ID NO: 351 

SEQUENCE LENGTH: 24 amino acids 

GHQCQCQGKGRHRADRRPDTAQEE 
15 10 15 20 



SEQUENCE ID NO: 352 

SEQUENCE LENGTH: 23 amino acids 

HQCQCQGKGRHRADRRPDTAQEE 
15 10 15 20 



SEQUENCE ID NO: 353 

SEQUENCE LENGTH: 25 amino acids 

GGHSYGGGPPMPMPGQREAPGRQEA 
1 5 10 15 20 25 

SEQUENCE ID NO: 354 

SEQUENCE LENGTH: 26 amino acids 

GGHSYGGGGPPMPMPGQREAPGRQEA 
1 5 10 15 20 25 

SEQUENCE ID NO: 355 

SEQUENCE LENGTH: 32 amino acids 

GGHSYGGGGHQCQCQGKGRHRADRRPDTAQ 
15 10 15 20 25 30 

E E 



SEQUENCE ID NO: 356 

SEQUENCE LENGTH: 31 amino acids 

GGHSYGGGHQCQCQGKGRHRADRRPDTAQE 
1 5 10 15 20 25 30 



E 

SEQUENCE ID NO: 357 

SEQUENCE LENGTH: 10 amino acids 

APCPQSSGGG 

15 10 

SEQUENCE ID NO: 358 

SEQUENCE LENGTH: 17 amino acids 
LPAPSQAAADELDRRPG 
15 10 15 

SEQUENCE ID NO: 359 

SEQUENCE LENGTH: 18 amino acids 
TKVRLIRGAPCPQSSGGG 
15 10 15 

SEQUENCE ID NO: 360 

SEQUENCE LENGTH: xx amino acids 

TKVRLIRGGAPCPQSSGGG 

1 5 10 

SEQUENCE ID NO: 361 

SEQUENCE LENGTH: 26 amino acids 

TKVRLIRGGLPAPSQAAADELDRRPG 
1 5 10 15 20 25 

SEQUENCE ID NO: 362 

SEQUENCE LENGTH: 25 amino acids 

TKVRLIRGLPAPSQAAADELDRRPG 
1 5 10 15 20 25 

SEQUENCE ID NO: 363 

SEQUENCE LENGTH: 45 amino acids 

CSLAKDGSTEDTVSSLCGEEDTEDEELEAA 
1 5 10 15 20 25 30 



ASHLNKDLYRELLGG 
35 40 45 

SEQUENCE ID NO: 364 

SEQUENCE LENGTH: 4 6 amino acids 

GCSLAKDGSTEDTVSSLCGEEDTEDEELEA 
1 5 10 15 20 25 30 

AASHLNKDLYRELLGG 
35 40 45 

SEQUENCE ID NO: 365 

SEQUENCE LENGTH: 21 amino acids 
AAAWQKMAPPRT PRPACVARR 
15 10 15 20 

SEQUENCE ID NO: 366 

SEQUENCE LENGTH: 54 amino acids 

ENSRPKRGGCSLAKDGSTEDTVSSLCGEED 
1 5 10 15 20 25 30 

TEDEELEAAASHLNKDLYRELLGG 
35 40 45 50 

SEQUENCE ID NO: 367 

SEQUENCE LENGTH: 55 amino acids 

ENSRPKRGGGCSLAKDGSTEDTVSSLCGEE 
1 5 10 15 20 25 30 

DTEDEELEAAASHLNKDLYRELLGG 

35 40 45 50 55 

SEQUENCE ID NO: 368 

SEQUENCE LENGTH: 30 amino acids 

ENSRPKRGGAAAWQKMAPPRTPRPACVARR 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 369 

SEQUENCE LENGTH: 29 amino acids 



ENSRPKRGAAAWQKMAPPRTPRPACVARR 
1 5 10 15 20 25 

SEQUENCE ID NO: 370 

SEQUENCE LENGTH: 10 amino acids 
HCVLAAS GAS 
15 10 

SEQUENCE ID NO: 371 

SEQUENCE LENGTH: 11 amino acids 
GHCVLAAS GAS 
15 10 

SEQUENCE ID NO: 372 

SEQUENCE LENGTH: 28 amino acids 

GTASSRPLGLPKPHLHRPVP IRHPSCPK 
1 5 10 15 20 25 

SEQUENCE ID NO: 373 

SEQUENCE LENGTH: 27 amino acids 

TASSRPLGLPKPHLHRPVPIRHPSCPK 
1 5 10 15 20 25 

SEQUENCE ID NO: 374 

SEQUENCE LENGTH: 18 amino acids 
AGTLQLGGHCVLAASGAS 
15 10 15 

SEQUENCE ID NO: 375 

SEQUENCE LENGTH: 19 amino acids 
AGTLQLGGGHCVLAASGAS 
15 10 15 



SEQUENCE ID NO: 376 

SEQUENCE LENGTH: 35 amino 
AGTLQLGGTASSR 



acids 

PLGLPKPHLHRPVPIRH 



1 5 10 15 20 25 30 

P S C P K 
35 

SEQUENCE ID NO: 377 

SEQUENCE LENGTH: 36 amino acids 

AGTLQLGGGTASSRPLGLPKPHLHRPVPIR 
1 5 10 15 20 25 30 

H P S C P K 
35 

SEQUENCE ID NO: 378 

SEQUENCE LENGTH: 9 amino acids 

RRTPSTEKR 

15 

SEQUENCE ID NO: 379 

SEQUENCE LENGTH: 10 amino acids 

RRTPSTEKKR 

15 10 

SEQUENCE ID NO: 380 

SEQUENCE LENGTH: 14 amino acids 
RRTPSTEKKGRSEC 
15 10 

SEQUENCE ID NO: 381 

SEQUENCE LENGTH: 13 amino acids 
RRTPSTEKGRSEC 
15 10 

SEQUENCE ID NO: 382 

SEQUENCE LENGTH: 4 6 amino acids 

STTKCQSGTAETYNSWKVKNLQLEPRRVTS 
1 5 10 15 20 25 30 

QMNRQVKDMTAILSQS 



35 40 45 

SEQUENCE ID NO: 383 

SEQUENCE LENGTH: 17 amino acids 

VQPNASQAQQKPTTHGR 

1.5 10 15 

SEQUENCE ID NO: 384 

SEQUENCE LENGTH: 54 amino acids 

SSEEIKKKSTTKCQSGTAETYNSWKVKNLQ 
1 5 10 15 20 25 30 

LEPRRVTSQMNRQVKDMTAILSQS 
35 40 45 50 

SEQUENCE ID NO: 385 

SEQUENCE LENGTH: 55 amino acids 

SSEEIKKKKSTTKCQSGTAETYNSWKVKNL 
15 10 15 20 25 30 

QLEPRRVTSQMNRQVKDMTAILSQS 
35 40 45 50 55 

SEQUENCE ID NO: 386 

SEQUENCE LENGTH: 26 amino acids 

SSEEIKKKKVQPNASQAQQKPTTHGR 
15 10 15 20 25 

SEQUENCE ID NO: 387 

SEQUENCE LENGTH: xx amino acids 

SSEEIKKKVQPNASQAQQKPTTHGR 

1 5 10 15 20 25 

SEQUENCE ID NO: 388 

SEQUENCE LENGTH: 9 amino acids 

NRGWVGAGE 

1 5 

SEQUENCE ID NO: 389 



SEQUENCE LENGTH: 4 amino acids 

I E A G 

1 



SEQUENCE ID NO: 390 

SEQUENCE LENGTH: 17 amino acids 
VHNYCNMKNRGWVGAGE 
1 5 10 15 

SEQUENCE ID NO: 391 

SEQUENCE LENGTH: 18 amino acids 
VHNYCNMKKNRGWVGAGE 
15 10 15 

SEQUENCE ID NO: 392 

SEQUENCE LENGTH: 13 amino acids 
VHNYCNMKKIEAG 
15 10 

SEQUENCE ID NO: 393 
SEQUENCE LENGTH: 12 amino acids 
VHNYCNMK I EAG 
15 10 

SEQUENCE ID NO: 394 

SEQUENCE LENGTH: 25 amino acids 

QLRCWNTWAKMFFMVFLI IWQNTMF 
1 5 10 15 20 25 

SEQUENCE ID NO: 395 

SEQUENCE LENGTH: 33 amino 
VKKDNHKKQLRCW 
15 10 
T M F 



acids 

NTWAKMFFMVFLI IWQN 
15 20 25 30 



• 



SEQUENCE ID NO: 396 

SEQUENCE LENGTH: 34 amino acids 

VKKDNHKKKQLRCWNTWAKMFFMVFL I IWQ 
1 5 10 15 20 25 30 

N T M F 



SEQUENCE ID NO: 397 

SEQUENCE LENGTH: 11 amino acids 
VKKDNHKKKNS 
15 10 

SEQUENCE ID NO: 398 

SEQUENCE LENGTH: 10 amino acids 

VKKDNHKKNS 

15 10 

SEQUENCE ID NO: 399 

SEQUENCE LENGTH: 35 amino acids 

GAEESGPFNRQVQLKVHASGMGRHLWNCPA 
15 10 15 20 25 30 

F W S E V 
35 

SEQUENCE ID NO: 400 

SEQUENCE LENGTH: 10 amino acids 

HPSPPPEKRS 

1 5 10 

SEQUENCE ID NO: 401 

SEQUENCE LENGTH: 11 amino acids 
HPSPPPEKKRS 
15 10 

SEQUENCE ID NO: 402 

SEQUENCE LENGTH: 44 amino acids 



HPSPPPEKKGAEESGPFNRQVQLKVHASGM 
1 5 10 15 20 25 30 

GRHLWNCPAFWSEV 
35 40 



SEQUENCE ID NO: 403 

SEQUENCE LENGTH: 43 amino acids 

HPSPPPEKGAEESGPFNRQVQLKVHASGMG 
I 5 10 15 20 25 3 

RHLWNCPAFWSEV 
35 40 



SEQUENCE ID NO: 404 

SEQUENCE LENGTH: 39 amino acids 

MQVLSKTHMNLFPQVLLQMFLRGLKRLLQD 
! 5 10 15 20 25 3 

LEKSKKRKL 
35 



SEQUENCE ID NO: 405 

SEQUENCE LENGTH: 8 amino acids 

RCKSARLI 

1 5 



SEQUENCE ID NO: 406 

SEQUENCE LENGTH: 48 amino acids 

VQTQPAIKKMQVLSKTHMNLFPQVLLQMFL 
1 5 10 15 20 25 3 

RGLKRLLQDLEKSKKRKL 
35 40 45 

SEQUENCE ID NO: 407 

SEQUENCE LENGTH: 49 amino acids 



VQTQPAIKKKMQVLSKTHMNLFPQVLLQMF 




10 15 20 25 

LRGLKRLLQDLEKSKKRKL 
35 40 45 

SEQUENCE ID NO: 408 

SEQUENCE LENGTH: 17 amino acids 
VQTQPAIKKRCKSARLI 
1 5 10 15 

SEQUENCE ID NO: 409 

SEQUENCE LENGTH: 16 amino acids 
VQTQPAIKRCKSARLI 
1 5 10 15 

SEQUENCE ID NO: 410 

SEQUENCE LENGTH: 11 amino acids 
ARSGKKQKRKL 
15 10 

SEQUENCE ID NO: 411 

SEQUENCE LENGTH: 12 amino acids 

ARSGKKQKKRKL 

'1 5 10 

SEQUENCE ID NO: 412 

SEQUENCE LENGTH: 13 amino acids 

A R S G KKQKKENSF 

1 5 10 

SEQUENCE ID NO: 413 

SEQUENCE LENGTH: 12 amino acids 
ARSGKKQKENS F 



10 



SEQUENCE ID NO: 414 

SEQUENCE LENGTH: 14 amino acids 
KASARSGKSKKRKL 

15 10 

SEQUENCE ID NO: 415 

SEQUENCE LENGTH: 15 amino acids 
KASARSGKKSKKRKL 
1 5 10 15 

SEQUENCE ID NO: 416 

SEQUENCE LENGTH: 16 amino acids 
KASARSGKKAKKENSF 
15 10 15 

SEQUENCE ID NO: 417 

SEQUENCE LENGTH: 15 amino acids 
KASARSGKAKKENSF 
1 5 10 15 

SEQUENCE ID NO: 418 

SEQUENCE LENGTH: 15 amino acids 
HLNKGRRLGDKIRAT 
I 5 10 15 



SEQUENCE ID NO: 419 

SEQUENCE LENGTH: 23 amino acids 

VTSGTPFFHLNKGRRLGDKIRAT 
1 5 10 15 20 



SEQUENCE ID NO: 420 

SEQUENCE LENGTH: 24 amino acids 

VTSGTPFFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 421 

SEQUENCE LENGTH: 10 amino acids 

VTSGTPFFFI 

15 10 

SEQUENCE ID NO: 422 

SEQUENCE LENGTH: 9 amino acids 

VTSGTPFFI 

1 5 

SEQUENCE ID NO: 423 

SEQUENCE LENGTH: 51 amino acids 

VTLLYVNTVTLAPNVNMESSRNAHSPATPS 
1 5 10 15 20 25 30 

AKRKDPDLTWGGFVFFFCQFH 
35 40 45 50 

SEQUENCE ID NO: 424 

SEQUENCE LENGTH: 60 amino acids 

KCRCKPNFFVTLLYVNTVTLAPNVNMESSR 
1 5 10 15 20 25 30 

NAHS PATPSAKRKDPDLTWGGFVFFFCQFH 
35 40 45 50 65 60 



SEQUENCE ID NO: 425 




SEQUENCE LENGTH: 61 amino acids 

KCRCKPNFFFVTLLYVNTVTLAPNVNMESS 

1 5 10 15 20 25 31 

RNAHSPATPSAKRKDPDLTWGGFVFFFCQF 

35 40 45 50 65 6 

H 

SEQUENCE ID NO: 426 

SEQUENCE LENGTH: 10 amino acids 

KCRCKPNFFL 

15 10 



SEQUENCE ID NO: 427 

SEQUENCE LENGTH: 9 amino acids 

KCRCKPNFL 

1 5 



SEQUENCE ID NO: 428 

SEQUENCE LENGTH: 9 amino acids 

SLVRLSSCV 

1 5 

SEQUENCE ID NO: 429 

SEQUENCE LENGTH: 14 amino acids 
LVKKLKEKKMNWIL 
15 10 

SEQUENCE ID NO: 430 

SEQUENCE LENGTH: 15 amino acids 
LVKKLKEKKKMNWIL 
15 10 15 



SEQUENCE ID NO: 431 



SEQUENCE LENGTH: 10 amino acids 

LVKKLKEKKR 

15 10 

SEQUENCE ID NO: 432 

SEQUENCE LENGTH: 9 amino acids 

LVKKLKEKR 

1 5 

SEQUENCE ID NO: 433 

SEQUENCE LENGTH: 9 amino acids 

AAIVKDCCR 

1 5 

SEQUENCE ID NO: 434 

SEQUENCE LENGTH: 11 amino acids 

SQPASILGRKL 

1 5 10 

SEQUENCE ID NO: 435 

SEQUENCE LENGTH: xx amino acids 

SQPAS ILGKRKL 

1 5 10 15 

SEQUENCE ID NO: 436 
SEQUENCE LENGTH: 18 amino acids 
SQPAS ILGKAAIVKDCC 
1 5 10 15 

SEQUENCE ID NO: 437 

SEQUENCE LENGTH: 17 amino acids 
SQPAS ILGAAIVKDCCR 
1 5 10 15 
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Fig. 3 
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Fig. 7 



